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Abstract 

The paper presents partial-realization theory and realization algorithms for linear 
switched systems. Linear switched systems are a particular subclass of hybrid sys- 
tems. We formulate a notion of a partial realization and we present conditions for 
existence of a minimal partial realization. We propose two partial-realization algo- 
rithms and we show that under certain conditions they yield a complete realization. 
Our main tool is the theory of rational formal power series. 
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1 Introduction 



The main objective of the paper 

The immediate objective of the paper is to present partial-realization theory 
of linear switched systems. However, the broader goal is to demonstrate that 

(a) it is possible to develop partial-realization theory for hybrid systems, 

(b) the theory of rational formal power series can be used to obtain partial- 
realization theory of hybrid systems. 
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The results of the paper just serve as evidence for the above claims. In fact, 
partial-realization theory can be developed along the same line for other classes 
of hybrid systems, [44] . This paper is intended as the first one in a series of 
papers dealing with partial-realization theory of hybrid systems. 

The class of linear switched systems 

Linear switched systems have been studied for almost two decades, see [35,51] 
for a survey. Their practical and theoretical relevance is widely recognized. 
Linear switched systems (abbreviated by LSS) are hybrid systems with ex- 
ternal switching only, whose continuous dynamics in each discrete mode is 
determined by a linear continuous-time system, and whose discrete dynamics 
is trivial, i.e. any discrete state transition is allowed. The continuous subsys- 
tems are defined on the same state-, input- and output-spaces. A discrete state 
transition occurs if the environment enforces one, i.e. the switching sequence 
itself functions as an input. The time evolution of a linear switched system 
can be pictured as follows. A discrete mode is selected and a continuous input 
is fed into the corresponding linear subsystem. The state and output of the 
linear switched system is now determined by the time evolution of the chosen 
linear subsystem. At some point in time, a new discrete mode is selected. Then 
the linear subsystem corresponding to the new discrete mode is launched, us- 
ing as initial state the state of the previous linear subsystem at the moment 
when a transition to the new mode occurred. From this point on, the state 
and output evolve according to the newly selected linear subsystem. 

Motivation 

We believe that similarly to linear systems, partial-realization theory for LSSs 
will be useful for systems identification and model reduction of LSSs and even 
of more general hybrid systems. We will elaborate on the possible applications 
in Section 8. 

Contribution of the paper 

The paper presents the following results. 

• Partial-realization algorithm and theorem for LSSs 

In Section 6 we propose a notion of partial-realization. In Theorem 3 and 4 
we formulate a sufficient condition for existence and minimality of a partial 
realization. We present two algorithms for computing a partial-realization. 
In one of them, Algorithm 1, a partial- realization is constructed from the 
columns of a finite sub-matrix Hankel-matrix, the other one, Algorithm 2, is 
based on finding a factorization of a finite sub-matrix of the Hankel-matrix. 
The outcomes of both procedures are isomorphic. The former is potentially 
useful for theoretical purposes, while the latter one might serve as a basis 
for subspace identification-like methods. The factorization algorithm was 
implemented and the implementation is available from the the first-named 
author on request. 
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• Realization algorithm for LSSs 

In Theorem 3 and 5 we show that the above partial-realization algorithms 
return a complete minimal realization of the input-output behavior, pro- 
vided the rank of the finite sub-matrix of the Hankel-matrix equals the 
rank of the full Hankel-matrix. We show that any finite sub-matrix whose 
size is above a certain lower bound has this property. 

• Partial-realization theory for families of formal power series 
Both the (partial-) realization algorithm and the corresponding theoretical 
results for LSSs follow quite easily from analogous statements for families 
of rational formal series. In Appendix B we present what can be termed 
as partial-realization theory and algorithms for families of formal power 
series. Although partial-realization theory of a single rational formal power 
series has been known in various forms [29,15,31,49,50,11,7], its extension to 
families of formal power series appears to be new. As it was already noted, 
this extension of the classical theory can be then used to develop partial- 
realization theory for a number of other (more general) classes of hybrid 
systems, [44]. 

Formal power series approach 

Our main tool is the theory of rational formal power series. Recall from 
[42,45,40] that there is a correspondence between LSS realizations and rep- 
resentations of certain rational families of former power series. Recall that the 
Hankel-matrix of a family of input-output maps is in fact the Hankel-matrix 
of the corresponding family of formal power series. Hence, by formulating a 
partial-realization theory and algorithms for rational families of formal power 
series, we immediately obtain partial-realization theory and algorithms for 
LSSs. 

Related work 

To the best of our knowledge, the only results on partial-realization of linear 
switched systems is the first-named author's thesis [44]. In [40,39] partial- 
realization theory of bilinear hybrid and switched systems are announced, but 
no proofs are provided. The thesis [44] contains most of the results of this 
paper. In addition, it also covers partial-realization theory of several other 
classes of hybrid systems. Reference [47] announces some results on partial- 
realization theory of stochastic jump-Markov systems. 

Realization theory of rational formal power series is a classical topic, see 
[49,50,30,2,34,12,13]. It is known to be closely related to realization theory 
of bilinear and state-affine systems [29,31,49,50]. In turn, results on partial- 
realization theory for discrete-time bilinear systems and state-affine systems 
can be found in [15,29,49,50,11,7]. With respect to [29,15,31,49,50,11,7], the 
main novelty of the paper is the following. 

(1) In this paper (in Appendix B) partial-realization theory is stated directly 
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for rational formal power series representations, without reformulating it 
in terms of some particular system class such as bilinear or state-affinc 
systems. 

(2) While the classical results [29,15,31,49,50,11,7] can be thought of as a 
reformulation of partial-realization theory for a single rational formal 
power series, here we deal with families of formal power series. 

(3) We apply partial-realization theory of rational formal power series to 
linear switched systems. This represents a novel result in the theory of 
hybrid systems. We view this as the main contribution of the paper. 

The statement of the main results on partial-realization theory for families of 
formal power series was announced [43,47,39,40,41,46], but no proof was ever 
presented. The thesis [44] contains the statement and the proof of the main 
results on formal power series. 

Outline of the paper 

The outline of the paper is the following. Section 2 presents an informal for- 
mulation of the partial-realization problem for LSSs. Section 3 discusses a 
numerical example. It is intended as an accessible explanation of the main 
results of the paper by means of an example. Section 4 presents the definition 
and some elementary properties of linear switched systems together with the 
basic notation and terminology. Section 5 provides a brief overview of the re- 
alization theory of linear switched systems. This section is a prerequisite for 
understanding the main results of the paper. Section 6 presents the formal 
statement of the main results of the paper. Section 7 presents the proof of the 
results stated in Section 6. Finally, in Section 8 we formulate the conclusions 
of the paper and we discuss potential applications of the presented results. As 
it was already remarked, partial-realization theory of LSSs is based on partial- 
realization theory of rational formal power series. Therefore, in Appendix A we 
included a brief overview of the extension of the theory of formal power series 
to families of formal power series. More precisely, Appendix A reviews the rel- 
evant concepts and results on formal power series from [42,40,45]. In Appendix 
B we present partial-realization theory of families of formal power series, and 
algorithms for computing a representation for a family of formal power series. 
Appendix B is in fact the main technical tool of the paper. It is a prerequisite 
for understanding the proof of the main results on partial-realization theory 
of LSSs. 



2 Informal problem formulation 

The goal of this section is to state the partial-realization problem for linear 
switched systems in an informal way. The next section, Section 3, provides a 
numerical example to illustrate the problem and the solution. We defer the 
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formal problem statement until Section 6. 
Brief review of linear-partial realization theory 

The partial-realization problem was originally formulated for linear systems. 
[33,28,52] as follows. Assume that the first N e N Markov parameters S = 
{Mi\f =1 of an input-output map / are specified. 

(a) Find conditions for existence of a linear system whose first N Markov- 
parameters coincide with S = {Mj}f =1 . Such a linear system is called a 
partial-realization of S. Characterize minimal partial-realizations. Find an 
algorithm for computing a (minimal) partial-realization. 

(b) Find conditions under which the thus obtained linear system is a minimal 
realization of the input-output map /. 

Notice that for discrete-time linear systems, the fcth Markov parameter coin- 
cides with the output of the system at time k for a particular input. Hence, 
the partial-realization problem can be seen as an identification problem. In 
fact, partial-realization theory of linear systems can be used for systems iden- 
tification and model reduction. 

Markov-parameters for LSSs 

Let $ be a family of input-output map which map continuous-valued inputs 
and switching sequences to continuous outputs. That is, the elements of $ 
are of the same form as the input-output maps generated by LSSs. Below 
we will define generalized Markov-parameters of a family $ of input-output 
maps which could potentially be realized by a LSS. The generalized Markov- 
parameters of $ are defined as certain high-order derivatives with respect 
to the switching times of the elements of $. More precisely, the Markov- 
parameters are indexed by triples consisting of the following components 

(a) elements of <£>, or pairs (q,j) where q runs through the set of discrete modes 
and j runs through the set {1,2,..., to}, where to denotes the number of 
continuous-valued input channels, and 

(b) sequences of discrete modes, and 

(c) discrete modes. 

Without going into details, the intuition behind the presented indexing is the 
following. A Markov-parameter of $ indexed by / G $, a discrete mode q, and 
a sequence of discrete modes qi, q 2 , ■ ■ ■ , qk, stands for the partial derivative 
of / with respect to the switching times, evaluated at zero, for the following 
switching scenario. The system goes through the modes qi, q 2 , • • ■ ,qk and then 
jumps to q. Here the continuous input is set to zero. The intuitive meaning of a 
Markov-parameter indexed by a pair (qo,j), a discrete mode q and a sequence 
of discrete modes qi, q 2 , ■ ■ ■ , qk is the following. Each input-output map / e $ 
can be written as a sum / = a/ + of two maps, where a/ is independent 
of the continuous input and is common for all the elements of <3> and it 
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is linear in continuous inputs. Roughly speaking, a/ accounts for the output 
from a certain initial state under zero input, and represents the input- 
output map induced by the zero initial state. The Markov-parameter indexed 
by the pair (q , j), discrete mode q and sequence q±, q 2 , ■ ■ ■ qu is the derivative 
of with respect to the switching times evaluated at zero, for the switching 
scenario where the system goes from mode go to ?i, ?2, • ' • 9fc and ends in q, 
and all the continuous input channels are except the jth one which is 1. 

Partial-realization problem for LSSs 

We will refer to a Markov parameter indexed by a sequence of discrete modes 
of length k as a Markov-parameter of order k. Fix a natural number N > 
and let S be the collection of all Markov-parameters of $ of order at most N. 
That is, S is simply a collection of high-order derivatives of the elements of $, 
such that the degree of derivation is bounded by N. A LSS £ is said to be a 
N -partial realization of $, if certain products of the matrices of £ are equal to 
the corresponding elements of S. In other words, E is a iV-partial realization 
of $ if the input-output maps of E and those of $ have the property that 
their derivatives corresponding to the Markov-parameters of order at most N 
coincide. From realization theory of LSSs [44,45,42] it follows that a LSS is a 
realization of $ if and only if it is a iV-partial realization of $ for all N E N. 
The partial realization problem for LSSs can be now stated as follows. 

• Find conditions for existence of a iV-partial realization of $ by LSS. Char- 
acterize minimal dimensional iV-partial realizations of <£>. Find an algorithm 
for computing a minimal iV-partial LSS realizations of <£>. 

• Find conditions under which a minimal iV-partial LSS realization of $ is a 
complete realization of $. 

The motivation for studying the partial-realization problem for LSSs is similar 
to that of for linear systems, i.e. we expect it to be useful for model reduc- 
tion and systems identification. In Section 8 we will present a more detailed 
description of the motivation and possible applications. 



3 Numerical example 

The purpose of this section is to demonstrate the main results of the paper by 
means of a numerical example. In this section we will tacitly use the notation 
and terminology of Section 4 and Section 6. 

Consider the linear switched system of the form 




(1) 
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where q(t) G {q±, ^2} is the discrete mode at time t, x(t) G R 5 is the continuous- 
state at t, y(t) 6 1 is the scalar output at t, and u(t) el is the scalar input at 
t. The system matrices A q , B q , C q describing the linear control system residing 
in a state (mode) q G {q±, q^} = Q are of the following form. 



A qi — 



A q2 — 
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Consider the initial states x\ = and x 2 = 1 • Consider the set of 

input-output maps $ = {fi, f'2} such that fi is realized by S from the initial 
state Xi, i = 1,2, i.e. fi(u,w) = yz(xi,u,w), for i — 1,2, for each continuous- 
valued input u G PC(T,U) and finite switching sequence w G (Q x T) + . 



Notice that $ can be realized by the following minimal linear switched system 



x(t) =A% t) x(t) + B™ (t) u(t) 
y(t) =C% t) x(t) 



(2) 
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where for each q e {91, 92}, the matrices A™, 5™, C™ are of the following form 
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More precisely, E m realizes /, from the initial states x™ 

-iT 

1 • 



for 



x 



and x™ 



1,2. Here, 



Since $ has a realization by a LSS, it is clear that it has a generalized kernel 
representation and hence its Markov-parameters can be defined. In order to 
give a better intuition on Markov-parameters of $, we have listed some of them 
in Table 1. Consider the upper-left sub-matrix H^ jK:L of the Hankel- matrix 
H$ of $. Recall that Hq> K,L is formed by the intersection of the columns of 
indexed by a sequence of discrete modes of length at most L, and by the rows 
of Hq> indexed by a sequence of discrete modes of length at most K. From 
Theorem 3 it follows that rank Hq, )K ,L < dim S m = 4 for all K, L > 4. In fact, 
it turns out that the Hankel-matrix H$ iNjN for iV = 2 has already rank 4. By 
Theorem 3 it means that we can already compute a minimal LSS realization 
of <3> from the generalized Markov parameters of $ of indexed by sequences of 
discrete modes of length at most 5 = 2 + 3. Applying Algorithm 2 to H^ Z 2 
yields the following minimal LSS realization of $. 



x{t) =A f m x(t) + B f m u(t) 
J ' S y(t) =C f q{t) x(t) 



(3) 



8 



Table 1 

Markov-parameters of $ 



(90, j) G ({91,92} x {!}) 


9 G Q 


w € Q* 


Markov-parameter S qm j(w) 


(92, 1) 


92 


e 





fe, 1) 


92 


9i 





(92, 1) 


92 


929292 





(92, 1) 


91 


e 


1 


(92, 1) 


91 


9i 





(92, 1) 


91 


92 


2 


(92, 1) 


91 


9292 


4 


(92, 1) 


91 


929292 


8 


(91,1) 


91 


e 





(91,1) 


91 


9i 





(91,1) 


91 


929292 





(91,1) 


92 


e 


1 


(91,1) 


92 


9i 





(91,1) 


92 


92 


3 


(91,1) 


92 


9292 


9 


(91,1) 


92 


929292 


27 


3 e $ 


9 G Q 


to G Q* 


Markov-parameter Sj >q (w) 


/i 


91 


e 





/i 


91 


9i 





/2 


91 


e 





h 


91 


9i 





h 


92 


e 


1 


h 


92 


9i 





h 


92 


92 


3 


h 


92 


9292 


9 


h 


92 


929292 


27 



9 



where the system matrices are of the form 



A f 



A f 




0.79 
-0.49 
-2.55 
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B f = 
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The LSS £/ realizes the input-output map f\ from the initial state x{ = 
(0, 0, 0, 0) T , and the map / 2 from the initial state x{ = (1-46, 0, 0, 0) T . 

However, it turns out that rank H$ 10 = rank H^ 01 — rank i?$,o,o- That is, 
the sequence of Markov parameters of $ indexed by sequences of discrete 
modes of length at most 1 already satisfies the partial realization theorems 
Theorem 3 and Theorem 5. Hence, we can apply Algorithm 2 to i?$,i,o to 
obtain a minimal LSS 1-partial realization of $. However, it can be checked 
that the thus obtained LSS is not a LSS realization o/$. Indeed, by applying 
Algorithm 2 to H$ :lj0 we obtain the following LSS realization of $ 



x(t) =A%«x(t) + Bffiu(t) 



(4) 



where the system matrices A part , B vari : , C part : , g G {gi, g 2 } = Q are as follows 
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The LSS Spar* is a partial realization of the input-output map f\ from the 
initial state Xi art = (0, 0) T , and of the input-output map f 2 from the initial 
state x p 2 art = (-1,5,0) T . That is, the Markov parameters of $ = {/i,/ 2 } 
which are indexed by sequences from the set {e, qi, q 2 } coincide with those of 



the input-output maps induced by the initial state x 



part 



1,2. It is easy 



to see that S part is not a realization of the input-output maps fi from the 
respective initial states x? art , = 1,2. That is, the input-output maps induced 
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by the respective states x v °" r , i — 1, 2, do not coincide with the maps One 
can either check it by direct calculation, or by using uniqueness of a minimal 
realization. Using the latter approach, it is enough to notice that S m is a 
minimal realization of $ and it is of dimension 4. Since S part of dimension 2, 
hence smaller than the dimension of S m , and all minimal realizations have to 
be of the same dimension, it follows that S part cannot be a realization of <3>. 

In fact, by checking the Markov-parameters, one can see that T> part recreates 
only the Markov-parameters indexed by sequences of length at most 1, but 
there is a Markov-parameter of $, indexed by a sequence of length 2, which 
is not generated by S part . That is, it indeed happens that a family of input- 
output maps generated by a LSS satisfies the sufficient conditions for existence 
of a iV-partial realization for some N, but the obtained partial realization is 
not a complete realization of the family of input-output maps. 



4 Linear switched systems 

This section contains the definition of linear switched systems. We will start 
with fixing notation and terminology which will be used throughout the paper. 
The notation used in this paper is mostly the standard one used in the field 
of control theory and formal language theory. In order to make the task of the 
reader easier, below we will list the most important notational conventions, 
grouped according to the disciplines. 

4-0.1 Notation from general mathematics and control theory 

Denote by T the set [0, +oo) C R of all non-negative reals. The set T will be 
the time-axis of the systems discussed in this paper. For any m > 0, denote 
by PC(T, M m ) the class of piecewise-continuous maps from T to R m . That 
is, / G PC(T, R m ), if / has finitely many points of discontinuity on each 
finite interval [0,t], t G T, and at each point of discontinuity the right- and 
left-hand side limits exist and they are finite. Denote by N the set of natural 
number including 0. By abuse of notation we will denote any constant function 
/ : T ->■ R m by its value. That is, if /(*) = a G M m for all t G T, then / will 
be denoted by a. For any function g the range of g will be denoted by Img, 
i.e. if g : A — > B for some sets A and B, then Img = {g(a) G B \ a G A}. If X 
is a vector space and Z is a subset of X, then SpanZ denotes the linear span 
of elements of Z in X. If X, y, Z are vector spaces over R, and Fi : X — > y, 
F 2 : y — > Z are linear maps, then FiF 2 denotes the composition Fi and F 2 . 

Let : R k ->■ W xm be a smooth map. Consider a k tuple of natural numbers 
a = (ai, a 2 , ■ ■ ■ , ct k ) G where cci, . . . , a k G N. We will denote D a 4> the 
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partial derivative of 4>(ti,t2, ■ ■ ■ ,tk) evaluated at zero, such that the order of 
derivation with respect to the variable ij is aij for i — 1, . . . , k. That is, 



D a (f) = 



d ai d a2 
dt? dtf 



d ak 



4>(ti, t 2 , . . . , t k ) | tl=t2= ... = t fc= o- 



For each i = 1,2, ...,n, ej denotes the jth unit vector of R ra , i.e. ej = 
(Sij, S 2 ,j, • • • , S n j), where 8 it j is the Kronecker symbol. 



4-0.2 Infinite matrices 

In this paper we will use the notation of [32] for matrices indexed by sets other 
than natural numbers. Let / and J be two arbitrary sets. A (real) matrix M 
whose columns are indexed by the elements of J and whose rows are indexed 
by the elements of / is simply a map M : I x J — > R. The set of all such 
matrices is denoted by R /xJ . The entry of M indexed by the row index j £ I 
and column index j G J is denoted by Mij and it is defined as the value 
of M at (i,j), i.e. M i:j = M{i,j). The case of usual finite matrices can be 
recovered by viewing n x m real matrices as matrices from R{ 1 ' 2 '---' n } x 't 1 ' 2 '---' rn }. 
In the sequel, when referring to the index set of a matrix, we will identify any 
natural number n with the set {1,2,..., n}. In other words, M. Ixn denotes the 
set of matrices ]R Jx I 1 ' 2 '-' n } and R nxJ denotes the set of matrices Rl 1 ' 2 '-' n } xJ . 

For a matrix M e M 7x J , the columns of M are simply maps of the form I — > R 
and the rows of M are maps of the form J — > R. The set of maps of the form 
I — > R and J — > R will sometimes be denoted by R 7 and R J respectively. 
Furthermore, if g G R 7 (resp. g G R J ) then the value of g at % G / (resp. j G J) 
will be denoted by gi (resp. gj). The column of M indexed by j G J will be 
denoted by M.j and is defined as Mj(i) = Mjj, i £ I. Similarly, the row if 
M indexed by % G / will be denoted by Mj v and is defined as Mi^(j) = Mij 
for all j G J. If M G R /xJ and 5 G R JxK and J is finite, then the product 
of M and S is the matrix MS G R /x/< such that (MS) itk = E iG j ^j^.fe for 
alH G /, k G X. In particular, if M G R /Xr and 5 G R^ xX for some natural 
number r G N, then their product M5 1 is well-defined and it belongs to R /xis: . 

We will identify a map / G R J with the matrix / G R Jxl defined as /^i = fj 
for all j G J. Hence, for a matrix M G R /xJ , the product Mf is defined 
as the following matrix in R /xl ; (Mf) i:1 = J2jeJ ^i,jfj- ^ n addition, we will 
occasionally identify the rows of a matrix M G R /xJ with matrices R lxJ . 
That is, the row Mj v of M indexed by % G / will be viewed as the matrix 
Mj r : {1} x J 3 (1, j) 1 — y Mij. With this identification, the product of the row 
Mj r with / is a scalar Mj v / = J2jej ^i,jfj e ^- Notice that here we tacitly 
assumed that J is finite. 
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Notice that the set of all maps R 7 forms a vector space with respect to point- 
wise addition and multiplication by scalar. That is, if /, g G R 7 and a,/3 G R, 
then the linear combination af + f3g : I — > R is defined by (a/ + Pg)(i) = 
a f(i) + f° r & H i £ I- Consider a matrix M G R 7xJ and recall that its 

columns are simply elements of R 7 . Hence, it makes sense to speak of the 
linear subspace spanned by the columns of a matrix M G R 7xJ . In the sequel, 
the rank of M, denoted by rank M G NU {00} will mean the dimension of the 
linear space spanned by the columns of M. If this dimension is not finite, then 
the rank is taken to be 00. We will denote by ImM the linear space spanned 
by the columns of M. 

If M G R 7xJ and J is finite, then M can be viewed linear map from R 
to R 7 , defined by (M/)(i) = E j& J M i,jfj = M iJ, i e /, for each / G R J . If 
S G R JxK and K is finite, then the product MS G R IxK corresponds to the 
linear map R^ — > M. 1 obtained by composing the linear map corresponding to 
S with the linear map corresponding to M. 



4-0.3 Notation from the theory of formal languages 

The notation described below is standard in formal languages and automata 
theory, see [14,10]. Consider a finite set X which will be called the alphabet. 
Denote by X* the set of finite sequences of elements of X. Finite sequences 
of elements of X will be referred to as strings or words over the alphabet X. 
For a word w = • • • a& G X*, ai, 02, . . . , G X, k > the length of w is 
denoted by \w\, i.e. \w\ = k. We will denote by e the empty sequence (word). 
The length of the empty sequence e is zero: |e| = 0. We will denote by X + 
the set of of non-empty words over X. That is, X + = X* \ {e}. Consider two 
words w G X* and v G X* of the form v = V\Vi ■ ■ ■ Vk, and w = W1W2 ■ • • w m , 
v±, i>2, . . . , Vk, Wi,ui2, . . • , w m G X. Define the concatenation vw G X* of the 
words v and w as the the word vw = v±v 2 ■ ■ ■ VkWiW2 ■ ■ ■ w m . In particular, if 
v — e, i.e. if k = 0, then vw = w. Similarly, if w — e, i.e. m = 0, then vw = v. 
If w G X + is a word, then w k denotes the word ww ■ ■ ■ w . The word w° is just 

fc— times 

the empty word e. 



^.i Definition and basic properties of linear switched systems 

Below we present the formal definition of LSSs. For a more detailed exposition, 
see [51,35,45,44,42]. 

Definition 1 (Linear switched systems) A linear switched system (ab- 
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breviated by LSS) is a control system E of the form 

f x(t) =A q{t) x{t) + B q{t) u{t) 
■ \ y(t) =C q(t) x(t) { } 

• x(t) G X is the continuous state at time t G T, 

• u(t) G U denotes the continuous input at time t G T ' , 

• q(t) G Q denotes the discrete mode (state) at time t, 

• y(t) G M p denotes the continuous output at time t G T . 

• The state-space is X = R™ ; the input-space is U = M. m , the output-space 
is y = W, and Q is the finite set of discrete modes (discrete states). Here 
n,m,p are positive integers. 

• For each discrete mode q G Q, the corresponding matrices are of the form 
A q G W nxn , B q G M nxm and C q G W xn . 

We will use (X,U,y,Q,{(A q ,B q ,C q ) \ q G Q}) as a short-hand notation for 
LSSs of the form (5). 

Notation 1 (Notation for the spaces of inputs and outputs) In the se- 
quel we denote by U the space W 11 of continuous-valued input, by y the space 
W of continuous-valued outputs, and by Q the set of discrete modes. 

Informally, the system (5) evolves as follows. For a piecewise-constant switch- 
ing signal q{.) : T — > Q and piecewise-continuous input u : T -> U, the state 
trajectory x : T — > U is a continuous piecewise-differentiable function which 
satisfies the differential equation (5). The output at time t G T is obtained 
by applying to x(t) the readout map (matrix) C q ( t )- Below we define state- 
and output- trajectories more rigorously. To this end, we define the notion of 
switching sequences. 

Definition 2 (Switching sequences) A switching sequence is a sequence 
of the form w = (qi,ti)(q 2 , t 2 ) ■ ■ • (qk, t k ), where q±, q 2 , ■ ■ ■ , qu G Q are discrete 
modes and ti,t 2 , ■ ■ ■ ,t k denote the switching times and k > 0. The set of all 
switching sequences is denoted by (Q x T)* . If k = then we say that w is 
the empty switching sequence and we denote it by e. We denote the set of all 
non-empty switching sequences by (Q x T) + . 

The interpretation of the sequence w = {qi,t\)(q 2 ,t 2 ) ■ ■ ■ (qk,tk) is the follow- 
ing. From time instance to time instance t\ the active discrete mode is qi, 
i.e. the value of the switching signal is q 1: from t 1 to ti + t 2 the value of the 
switching signal is q 2 , from t\ + t 2 to t\ + t 2 + t 3 the value of the switching 
signal is ^3, and so on. That is, the non- negative real ti indicates the time 
spent in the discrete mode qi, for all % — 1, 2, . . . , k. In this paper the switching 
sequences are regarded as inputs and we allow any switching sequence to occur. 
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Definition 3 (State of LSSs) Let u G PC(T,U) be a continuous-valued in- 
put and let w = (qi, ti)(q , 2 ? t 2 ) ■ ■ ■ (qk, tk) G (Q x T) + fre a non-empty switching 
sequence. The state of E reached from the initial state x & X with the inputs 
u and u> zs denoted by x-z(x , u, w) and it is defined recursively on k as follows. 

• If k = 1, then x-s(xo,u, (qi,t\)) is the solution at time t\ of the differential 
equation 

x(s) = A qi x(s) + B qi u(s) 

with the initial condition x(0) = xq. 

• U x ^( x o, u , (Qi,ti)(q2,t2) • • • (qk-i,tk-i)) is already defined, then let x-s(xo,u,w) 
be the solution at time tk of the differential equation 

k-i 

x(s) = A qk x(s) + B qk u{s + tj) 

j'=i 

with the initial condition x(0) = Xj](xo,u, (qi,ti) ■ ■ ■ (qk-i,tk-i))- 

That is, the states evolves according to the differential equation determined by 
the discrete mode. If a switch occurs, then the state at the time of the switch 
is taken as the initial condition for generating a solution to the differential 
equation associated with the new discrete mode. As a next step, we will define 
the output trajectories of LSSs. 

Definition 4 (Output of LSSs) Consider a continuous-valued input u G 
PC(T,U) and a non-empty switching sequence w = (qi, ii)(?2, £2) • • • (Qk, tk) £ 
(Q x T) + . The output generated by the LSS S if it is started from initial 
state Xq G X and fed with the inputs u and w, denoted by 2/s(x , u, w) G y, 
is defined by 

y s (x ,w,u>) = C qk xx(x ,u,w) (6) 

That is, the current output is obtained from the current continuous state 
xs(xo, u, w) by the application of the readout map C Qk associated with the 
current discrete mode qk- We define the input-output map of a LSS induced 
by a particular initial state as follows. 

Definition 5 (Input-output maps of LSSs) Consider a state x G X of 
E. Define the input-output map of E induced by the state Xq as the map 
Uz( x o, •) : PC(T,U) x (Q x T) + — >■ y such that for all continuous-valued 
input u G PC(T,U) and for all non-empty switching sequence w G (Q x T) + 

yv(x ,.,.)(u,w) = yz(xo,u,w) (7) 

That is, the value of y^(x , ., .) at (u,w) equals the output generated by E if 
started from the initial states x and fed inputs u and w. By abuse of notation 
we will denote yx{xo, ■, -)(u, w) by ys(^o, u, w). 



15 



5 Overview of realization theory for LSS 

The goal of this section is to present a brief overview of realization theory 
of LSSs. In Subsection 5.1 we recall the definition of input-output maps of 
LSSs and the notion of a LSS realization. In Subsection 5.2. we review a 
number of system-theoretic concepts such as observability, semi-reachability, 
minimality and LSS morphisms. In Subsection 5.3 the realization problem 
will be formulated. Subsection 5.4 recalls the concept of generalized kernel 
representation of a family of input-output maps. Subsection 5.5 presents the 
definition of Markov-parameters for LSSs. Finally, Subsection 5.6 presents the 
main results on realization theory of LSSs. For a more details on the material 
of this section see [44,42,45]. 

5.1 Input-output maps 

In this section we define the class of maps which represent potential input- 
output maps of LSSs. In addition, we introduce the notion of a LSS realization. 

Definition 6 (Input-output maps) In this paper, unless stated otherwise, 
an input-output map will mean a map of the form f : PC(T, U)x(QxT) + — > 
y. The set of all such maps will be denoted by F{PC(T,U) x (Q x T) + ,y). 
A family of input-output maps is just a (possibly infinite) subset of the set of 
all input-output maps F(PC(T,U) x (Q x T)+,y). 

That is, an input-output map maps continuous-valued inputs and non-empty 
switching sequences to continuous-valued outputs. In order to formalize the 
notion of a realization by a LSS of a family of input-output maps we will adopt 
the following formalism. 

Definition 7 (Realization of input-output maps) Consider a set $ C 
F(PC(T,U) x (Q x T) + ,y) of input-output maps. The family $ is said to 
be realized by a LSS £ of the form (5) if there exists a map //:<&—)■ X , which 
maps each input-output map f from $ to a state fi(f) o/E, such that f equals 
the input-output map induced by [x(f), i.e. 

Wu e PC(T,U),w e (Q x T)+ : yMf),u,w) = f(u,w) (8) 

One can think of the map ji as a way to determine the initial state correspond- 
ing to each element of $. 

Definition 8 (Realizations) Let $ be a family of input-output maps. We 
will refer to the pairs (S,/i) 7 where X is a LSS of the form (5) and //:<&—)■ 
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X is a map mapping elements of $ to the states of E ; as LSS realizations 
( realizations for short). The realization (E,/x) is said to be a realization of $ ; 
if (8) holds for all f G $. 

Note that not any realization (E, /x) with /i : $ — > X is a realization of $. The 
statement that (E, \x) is a realization only expresses the fact that we associate 
a state of E with each element of $. However, we do not yet require that 
the input-output map induced by a designated state equals the corresponding 
element of $. The latter is required only if we claim that (E, /i) is a realization 
of®. 

5.2 System-theoretic concepts 

The goal of this section is to define system theoretic concepts such as ob- 
servability, span-reachability, system morphism, dimension and minimality for 
LSSs and for LSS realizations. Throughout the section E denotes a LSS of the 
form (5). 

The reachable set of E from a set of initial states Xq C X is defined as 

Reach(Z, X ) = {x^{x ,u,w) G X | u G PC(T,U),w G (Q x T) + ,x G X } 

That is, i?eac/i(E, X ) is the set of all those states which are obtained by 
starting the system from an initial state in Xq, applying some continuous- 
valued input and some finite switching sequence, and considering the state at 
the last switching time. 

Definition 9 ((Semi-) Reachability) The LSS E is said to be reachable 
from X if Reach(Y>, X ) = X holds. The LSS E is semi-reachable from X if 
X is the smallest vector space containing Reach(T>, X ) . 

In other words, E is semi-reachable from X if the linear span of the elements 
of the reachable set Reach(Yt, Xq) yields the whole state-space X . We proceed 
with defining the notion of observability for LSSs. 

Definition 10 (Observability and Indistinguishability) Two states x\ ^ 
x 2 G X of the LSS E are indistinguishable if the input- output maps in- 
duced by Xi and x 2 coincide, i.e. for all u G PC(T,U) and w G (Q x T) + , 
Uz( x i,u,w) = yz(x 2 , u, w). The LSS E is called observable if it has no pair 
of distinct indistinguishable states. 

In other words, observability means that if we pick any two states of the 
system, then we are able to distinguish between them by feeding a suitable 
continuous- valued input and a suitable switching sequence and then observing 
the resulting output. 
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Below we will define the notion of dimension for LSSs. 

Definition 11 (Dimension of LSSs) Define the dimension of E ; denoted 
by dimE, as the dimension dim X = n of is state-space. 

Now we are ready to define the concept of minimality for LSSs . 

Definition 12 (Minimality of LSSs ) Let $ be a family of input-output 
maps and let (E, jj) be a LSS realization o/$. (E,/i) is a minimal realization 
of $, if for any LSS realization (E,/2) o/$, dimE < dimE. 

In simple words, a LSS realization is a minimal realization of $ if it has the 
smallest dimensional state-space among all the linear switched systems which 
are realizations of $. 

The notions of observability and semi-reachability can be extended to LSS 
realizations as follows. 

Definition 13 (Observability and semi-reachability of realizations) Let 

$ be a family of input-output maps and let fi : $ — > X be a map from $ to 
the state-space o/E. The realization (E,/z) is semi-reachable , if E is semi- 
reachable from the range lm.fi of \i. The realization (S,yu) is observable, if E 
is observable. 

As the next step, we will define the notion of a LSS morphism. 

Definition 14 (Linear switched system morphism) Consider a LSS E x 
of the form (5) and a LSS E 2 of the form E 2 = (X a ,U, 3>, Q, {(A a q , B a q , C°) | 
Q £ QjjZI- A linear map S : X — >■ X a is said to be a LSS morphism from E x 
to Tj2, and it is denoted by S : Ei — > E 2; if for all discrete modes q G Q, 

AyS = SA q , 13® = SBg, CgS = C q (9) 

The map S is called surjective ( injective ) if it is surjective ( injective ) as a 
linear map. The map S is said to be a LSS isomorphism, if it is an isomor- 
phism as a linear map. Consider two LSS realizations (Ei,// X ) and (E 2 ,/i2) 
such that the domain of definition of both Hi and /i 2 is a certain family $ of 
input-output maps. A LSS morphism S : E x — > E 2 is called a LSS morphism 
from realization (Ei,/ii) to (E 2 ,/x 2 ), if S o fi l = fi 2 holds, or, in other words, 
if for all f G $, S(/ii(f)) = ^{f)- The fact that S is a LSS morphism from 
(Ei,/ii) to (E 2 ,/i 2 ) will be denoted by S : (E]_,/ii) —> (E 2 ,/i 2 ). The LSSs real- 
izations (Ei, fix) and (E 2 , /i 2 ) are said to be algebraically similar or isomorphic 
if there exists an LSS isomorphism S : (E^/ix) — > (E 2 ,/z 2 ). 



Notice that the LSSs Si and S 2 have the same set of discrete modes. 
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5.3 Realization problem 



The realization problem for LSSs can be formulated as follows. 

Problem 1 (Realization problem for LSSs ) Find necessary and sufficient 
conditions for existence of a LSS realization for a family of input- output maps 
<3>. Find a characterization of minimal LSS realizations of <3>. Determine if 
minimal realizations of <E> are unique in any sense. 

5.4 Generalized kernel representation 

In this section we will recall the notion of generalized kernel representation. 
It turns out that a family of input-output maps can be realized by a LSS 
only if it admits a generalized kernel representation. Informally, a family $ 
of input-output maps has a generalized kernel representation if the following 
hold. 

(1) There exists an input-output map such that for all / £ $, f(u,w) = 
f(0,w) + y®(u,w) for all continuous-valued inputs u and switching se- 
quences w. 

(2) Each element / of $ is affine in continuous-valued inputs and analytic in 
switching times for all constant inputs. 

A good intuition for the notion of generalized kernel representation can be 
derived by analogy with input-output maps of linear systems. Recall from [6] 
that an input-output map y : PC(T,U) x T — >■ y can be realized by a linear 
system (A, B, C) £ R nxn x R nxm x W xn from the initial state x £ R n , only 
if there exist analytic functions K : T ->■ R p and G : T ->■ R pxm such that 

y(u, t) = Kit) + f Git - s)uis)ds (10) 
J 

More precisely, in this case 

Kit) = Ce At x and G(t) = Ce At B (11) 

Using the setting above, the maps and G* to be defined below are anal- 
ogous to the map K and G respectively. The formal definition is as follows. 

Definition 15 (Generalized kernel-representation) Consider a family of 
input-output maps as defined in Definition 6. The family $ is said to have 
generalized kernel representation, if for all input-output maps f £ $ and for all 
non-empty sequences of discrete modes w = qiq 2 ■ ■ ■ qu £ Q + , Qi,Q2 ■ ■ ■ ,Qk £ Q, 
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k > 0, there exist functions 

K f f :T k and : T fc -> W xm 

such that the following holds. 

(1) For each word w G Q + and for each input-output map / € <&, t/ie func- 
tions K^f* and G% are analytic. 

(2) For each input-output map / G $ and for each (possibly empty) se- 
quences w,v G Q* , and for each discrete mode q G Q, it holds that for all 

h, t 2 , ■ ■ ■ ,t\ w \,t,t,t\ w \ +2 , ■ ■ ■ ,t\ 

w\+\v\+l G J ; 

Kwqqvitlifoi • • • i t\w\,t, t, t\ w \ + 2, ■ ■ ■ t\ w \+\ v \+i) = 
= Kj^ v {t-y,t2, ■ ■ ■ t\ w \,t + t, t\ w \ +2 ■ ■ ■ t\ w \+\ v \+i) 
G% qqv {tlit2i ■ ■ ■ , t\ w \,t, t, t\ w \+2, ■ ■ ■ t\ w \+\ v \+i) = 

— G% qv {ti,t 2 , ■ ■ ■ t\ w \,t + i, t\ w \ + 2 ■ ■ ■ t\ w \+\ v \+i) 

(3) For each pair of sequences v,w G Q* such that and w is not the empty 
word, i.e. \w\ > 0, and for each discrete mode q G Q, the following holds. 
For each input-output map / G $ and for each t 1 ,t 2 , ■ ■ ■ , t\ vw \ G T , 

Kvqwitlihi ■ ■ ■ ,*H,0,*H+1, • • • i*h) = K vw (*1)*2, • • • ,t\vw\) 

For each pair of words v , w G Q* such that both v and w are not empty, 

i. e. \v\ > 0, \w\ > 0, and for each discrete mode q G Q, the following 
holds. For each t±, t 2 , ■ ■ ■ , t\ vw \ G T, 

Gy qw (tl : t2 : ■ ■ ■ , t\ v \, 0, t\ v \ + i, . . . , t\ wv \) = G® w (ti ) t 2) ■ ■ ■ , t\ vw \) 

(4) For each input-output map f G for each non-empty switching se- 
quence w = {qi,t 1 )(q 2 ,t 2 ) ■ ■ ■ {q k ,t k )(Q x T) + , where q u q 2 , . . . , q k G Q 
and t 1 ,t 2 , . . . ,t k G T, each piecewise- continuous input u G PC(T,U), the 
following holds. 

f(u,w) = K^ 2 ... qk (t u t 2 ,...,t k ) + 

k rU 

E L Gl qi+1 ... qk (ti - s,t i+1 , . . . ,t k )u(s + J2tj)ds 

i=l JU j=l 

The reader may view the functions as the part of the output which 

depends on the initial condition and the functions G% as functions determining 
the dependence of the output on the continuous inputs. 

In fact, let (£,//) be a LSS realization of $ and assume that S is of the form 
(5). It is easy to see that for all / G $ and for any input u G PC{T,U) and 
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switching sequence (qi,t 1 )(q 2 ,t 2 ) • • • (q k ,t k ) G (Q X T) + , 

yMf),u,w) = C qk e A ^e A ^-^ ■ ■ • *>(/)+ 

Jo x 

/■** i fe ~ 2 ( 12 ) 

+ C^e^*** / e^-i^- 1 -^.^^ + ]T + • • • 

• • • + CLe^e^*-!**- 1 • • • e'Mfc-'Jfl u ( s )d s 

Jo 

From the equation above it is easy to see that $ admits a hybrid kernel 
representation of the form 

JM, ... ,t k ) =C qk e A ^e A ^^ ■ ..e A ^B qi 
K^ q ,.. qk (h,t 2 ,...,t k ) =C qk e A ^e A ^^---e A ^^(f). 1 J 

for all non-empty sequences of discrete modes q±, q 2 , ■ ■ ■ , q k £ Q, k > 1. 

5. 5 Generalized Markov parameters for LSS 



Next we will define the notion of Markov parameters for input-output maps. 
Markov-parameters play a central role in (partial) realization theory of LSSs. 

Before proceeding further, recall from classical linear systems theory [6] the 
notion of Markov parameter. Consider the linear input-output map of the form 
(10) and define the Markov parameters of this map as derivatives of the maps 
K : T -> W and G : T -> W* m ; M k = ^G(t)\ t=0 and N k = £,K(t)\ t=0 for 
all k > 0. In turn, the derivatives of G(t) and -KT(t) can be expressed as the 
derivatives of the input-output map as follows. For a constant input u E U 
define the map 

f u :T3t^ Kit) + ( / Git- s)ds)u 

Jo 

That is, f u (t) is just the value of the input-output map (10) at time t if a 
constant input u is fed in. Then f (t) = K(t) and hence N k = -p-/o(£)|t=o- 
Similarly, the jth column of M k can be written as ^+r\fej\t)~ fo(t))\t=o where 
e 3 -, j — 1, 2, . . . , m is the jih unit vector of U = R m . That is, the Markov- 
parameters of a linear input-output map are the high-order time derivatives of 
the output trajectories induced by certain constant inputs, evaluated at zero. 

For LSSs the Markov-parameters are defined in a similar way. Before proceed- 
ing to the definition, we need the following notation 

Notation 2 (Input-output maps as time functions) Consider an input- 
output map f as in Definition 6, a non-empty sequence of discrete modes 
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w = <?i<?2 ■ ■ ■ % G Q + , Qi, Q2, ■ ■ ■ , Qk £ Q, k > 1 and a continuous-valued input 
u G PC{T,U). Define the map f u>w :T k ^y as follows 

/ti,i«;(*i,f 2, •••,**) = f(u,(q 1 ,t 1 )(q 2 ,t 2 ),...,(qk,tk)) (14) 



That is, the values of f UyW are obtained from the values of / by fixing the a 
piecewise-continuous input u and a sequence of discrete modes w and varying 
the switching times only. 

Definition 16 (Markov-parameters of $) LetQ be a family of input- output 
maps admitting a generalized kernel representation. The Markov parameters 
of $ are the vectors S q0tq j(w) G W and Sf tq (w) G MP, defined for all discrete 
modes q ,q G Q , all the sequences of discrete modes w G Q* (including the 
empty sequence), all j = 1,2, ... ,m and f G $ as follows. If w = q\q 2 ■ ■ ■ qk 
for k > 0, qi, q 2 , . . . , qk G Q, then 

Sf,q( W ) = ^~^~ ' ' ' ^/0,5l52-"<?fc<?(*l>*2, • • • ,t k ,0)\ tl = t2 -t k =0 
Sq,qo,j( W ) = ^~d^~dj~ ' ' ' ^^•^ e J' < ?0<?l<?2---gfc<?(^0) tl, t 2 , . . . , tk, 0)| to=tl= t 2 ... tt . = o 

d d d d , 
_ ~a\~d^dt 2 ' ' ' '^f k ■' ' qoqlq2 "' qkq ^ 0, 1: ' ' ' ' ^)l*o=ti=*2-*fc=o 

Here tj is the jth unit vector ofU — M m . 

The vectors Sf, q (w) correspond to the parameters Nf~ for linear systems, and 
Sq,q ,j( w ) corresponds to the jth column of the M k component of the Markov 
parameters. As it was shown in [44,42,45], there is a close relationship between 
the Markov parameters of $ and products of system matrices. 

Lemma 1 ([44,45,42]) Let E be of the form (5) and let fj, : $ ->■ X . The 

realization (S,/i) is a realization of $ 7 if and only if the Markov-parameters 
of $ are equal to the following products of the system matrices. 

Sq, qo ,j{qiq2 ■ ■ ■ Qk) = C q A qk A qk _ x ■ ■ ■ A qi B qo ej 
Sf, q (qiq2 ■■■q k ) = C q A qk A qk _ x ■ ■ ■ A qi fi(f) 

for each q,qo G Q, for each q±,q 2 , . . . ,q k G Q, k > 0, / G $ and j = 
1,2,..., m. 

Similarly to the linear case, the Markov-parameters of $ can also be repre- 
sented as derivatives of the maps and G%, see [44,42,45]. 



22 



5. 6 Main results on realization theory for LSSs 



The purpose of this section is to present formally the main results on realiza- 
tion theory of LSSs . To this end, the notion of the Hankel-matrix H$ of $ is 
defined. Similarly to the linear case, the entries of the Hankel-matrix will be 
formed by the Markov parameters. 

Definition 17 (Hankel-matrix) Assume that the set of discrete modes Q 
has D elements and choose an enumeration of Q 

Q = {<7i,<7 2 , . . . ,a D } (16) 

Recall the notation for infinite matrices presented in Section 4-0.2. Define the 
Hankel-matrix of $ as the infinite real matrix, columns and rows of which are 
indexed as follows. The rows of are indexed by pairs (v,i) where v G Q* 
is a finite sequence of discrete modes and i G {1,2,... ,pD}. The columns of 
H$ are indexed by pairs (w,j), where w G Q* is a sequence of discrete modes 
and j is either an element of $ ; or j is a pair of the form (q, z), where q is a 
discrete mode and z G {1, 2, ... , m). That is, H§ is an infinite real matrix of 
the form 

H*eR( Q * XI ^ Q * XJ ^ (17) 

where I = {1,2,... ,pD} and J$ = <E»U (Q x {1, 2, . . . , m}). The entries of 
are defined as follows. Fix sequences of discrete modes w,v G Q* and fix an 
element j G J$. For any % of the form i = pK + r + 1 where K — 0, 1, . . . ,D — 1 
and r = 0, 1, . . . ,p— 1, the entry (i?$) is defined as follows 

,, , (S aK+uq , z (wv)) r+1 ifj = (q,z) G Q x {l,...,m} 

l-n*;(«,i),(toj) = \ (1°) 

{Sf,a K+ A wv ))r+i ifj = fe& 

Here (S aK+1>q>z (wv)) r+1 and (S f >(TK+1 (wv)) r+ i denote the r + 1th element of the 
vectors S aK+1>q>z (wv) G M p and Sf >rTK+1 (wv) G W respectively. 

That is, H<$, is constructed from certain high-order derivatives of the input- 
output maps belonging to $. As it was noted in Section 4.0.2, the columns 
of Hq, belong to the vector space of all maps (Q* x {1,2, ... ,pD}) — > R. 
Hence, we can speak of the linear span of the columns of H<$,. In addition, 
according to the convention adopted in Section 4.0.2, the rank of H$, denoted 
by rank Hq> G N U {+oo}, is the dimension of the linear subspace spanned 
by the columns of Now we are ready to state the main theorem on the 
existence of a LSS realization for arbitrary switching. 

Theorem 1 (Realization of input-output maps, [44,45,42]) Let& be a 
family of input-output maps. Then $ has a realization by a LSS if and only 
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if $ has a generalized kernel representation and the rank of the associated 
Hankel-matrix Hq, o/$ is finite, i.e, rank H$ < +00. 

Next, we state state the main result of on minimality of LSSs . 

Theorem 2 (Minimality, [44,45,42]) // (£,//) is a LSS realization of <E> ; 
then the following are equivalent. 

(i) (£,//) is a minimal LSS realization 0/$. 

(ii) The realization (£,//) semi-reachable and it is observable. 

(Hi) The state-space dimension ofH equals the rank of the Hankel-matrix of <£>, 
i.e. dimS = dimiJ$. 

In addition, all minimal LSS realizations of $ are isomorphic. 



6 Main results of the paper 

The purpose of this section is to present the main results of the paper for- 
mally. The outline of this section is as follows. In Subsection 6.1 we state the 
partial-realization problem for LSSs formally. In Subsection 6.2 we define the 
notion of Hankel sub-matrix, which will be needed for the statement of the 
main results. In Subsection 6.3 we present the theorem characterizing the ex- 
istence and minimality of partial realizations. In addition, we present a simple 
algorithm for computing a minimal partial realization. Finally, in Subsection 
6.4 we present a Kalman-Ho-like algorithm for computing a minimal partial 
realization. 



6.1 Partial-realization problem 

Let $ be a family of input-output maps admitting a generalized kernel repre- 
sentation. As we saw earlier, the Markov-parameters uniquely determine the 
input-output maps of a linear switched system. In addition, the realization 
problem for LSS can be reduced to the problem of finding a suitable repre- 
sentation of the Markov-parameters of the input-output maps. However, in 
practice we can obtain only Markov-parameters only up to some finite order. 
Hence, the partial-realization problem (with respect to the number of switches) 
arises. In order to present a formal problem formulation, the notion of (mini- 
mal) partial realization has to be introduced. 

Definition 18 (iV-partial realization) Let $ be a family of input-output 
maps admitting a generalized kernel representation. Assume that E is a LSS 
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of the form (5) and let ji : $ — >■ X . A realization (S,/i) is said to be an N- 
partial realization of $ 7 if for all q,q Q £ Q, f £ j = 1,2, ... ,m, and for 
all (possibly empty) sequences of discrete modes q 1 , q 2 , . . . , qk £ Q, of length at 
most N , i.e. N > k > 0, the following holds. 

S qm j(qiq2 ■ ■■qk) =C q A qk A qkl ■ ■ ■ A qi B qo ej 
Sf, q (lil2---qk) =C q A qk A qk _ x ■ ■ ■ A qi fi(f) 

The LSS E is said to be an N -partial realization of§, if for there exists a map 
ji : $ — )> X such that the realization is a N -partial realization o/$. The 

realization is said to be a partial realization of $ ; if it is a N -partial 

realization of $ for some N G N. 

From Lemma 1 it follows that (£, fi) is a realization of $ if and only if (E, fi) is 
an iV-partial realization of $ for all N E N. Note that if $ and (E, fi) is a iV- 
partial realization of $, then (19) holds for finitely many Markov-parameters. 



Remark 1 (Terminology: Markov-parameters of order N) In the se- 
quel, we will often refer to the Markov-parameters which are indexed by a 
sequence of discrete modes of length N as Markov-parameters of order N. 
That is, a Markov-parameter of order N is either of the form S qtqo j(w) or 
Sf, q (w) with \w\ = N. 

With this terminology, a LSSis a iV-partial realization of $, if it recreates the 
Markov-parameters of $ of order at most N. If $ is finite, then the set of 
Markov-parameters of $ of order at most N is finite. 

Definition 19 (Minimal iV-partial realization) A LSS realization (E,/x) 
is said to be a minimal N -partial realization of $ ; if (S,/i) is a N -partial 
realization of <3> and for any other N -partial LSS realization (E , (j, ) of $ 7 
dimE < dimE'. 

In plain words, a minimal iV-partial realization of $ is a iV-partial realization 
of $ with the smallest possible state-space dimension. 

Problem 2 (Partial-realization problem) Let& be a family of input- output 
maps admitting a generalized kernel representation. The partial-realization 
problem entails the following problems. 

• Find conditions for existence of a N -partial realization o/$. Formulate an 
algorithm for computing a N -partial realization from generalized Markov- 
parameters of $ of some bounded order. If $ is finite, this amounts to 
computing a N -partial realization from finitely many Markov-parameters. 

• Characterize minimal N -partial realizations of find conditions for their 
existence and uniqueness. 
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• Find conditions under which a N -partial realization of $ is a complete re- 
alization of $ in the sense of Definition 8. 

We will devote the remaining part of the section to presenting the solution to 
the problem formulated above. We will show that it is possible to compute a 
minimal iV-partial realization from a suitably chosen sub-matrix of the Han- 
kel matrix of $. In fact, this sub-matrix is formed by Markov-parameters of 
bounded order, and hence it is finite, if $ is finite. If N is large enough, the 
resulting iV-partial realization will be a complete realization of $. In addition, 
if $ is finite, then this sub-matrix will be finite as well. 



6.2 Finite Hankel-matrices 

In order to state the main results rigorously, we need to define formally, certain 
finite sub-matrices of the Hankel-matrix. To this end, recall from Definition 
17 the definition the Hankel-matrix of <£> and of the sets / = {1, 2, . . . ,pD}, 
D = \Q\ and J$ = $ U (Q x {1, 2, . . . , to}). Fix natural numbers L, M e N. 
We will use the following notation for sequences of discrete modes of length 
at most L. 

Notation 3 We will denote by Q- L the set of all sequences of discrete modes 
of length at most L, i.e. Q- L = {w e Q* \ \w\ < L}. 

Notice that Q- L is a finite set of cardinality (\Q\ L+1 - 1)/(\Q\ - 1). We will de- 
fine the sub-matrix of the Hankel-matrix Hq> indexed by sequences of bounded 
length as follows. 

Definition 20 (i?$,L,M sub-matrices of the Hankel-matrix) Let Hq, >L>M 
be the sub-matrix of H$, formed by the intersections of all the rows indexed by 
an index of the form (v, i) with % e / and v e Q- L and all the columns indexed 
by an index of the form (w,j) with j e J$ and w G Q- M . That is, H$ ;L;M e 

R {Q^xI)x(Q<MxJ*) and f or mch ( v>i ) eI X Q<L and ( w j) e (g<M x 

(H<t, jLjM )(v,i),(w,j) = (#<&) (<;,*),(«, j) 

Notice that the entries oiH§ L M are made up of Markov-parameters of order 
at most K + L. Hence, if $ is finite, then the set J$ is finite and the matrix 
H$,l,m is a finite matrix. Since the entries of the Hankel-matrix were defined 
in terms of the generalized Markov-parameters of the same is true for the 
entries of i?$,jv,M- Writing out the definition of the entries of i?$,L,M and using 
the enumeration of Q fixed in (16) of Definition 17, we get that for any i of 
the form % = Kp + r + 1 where K — 0, 1, . . . , D — 1 and r = 0, 1, . . . ,p — 1, 
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the entry (H$ t L t M)(v,i),(w,j) is defined as follows 
,„ v j (S<r K+1 ,q,z(wv)) r+ i if j = (q, z) E Q x {1 , . . . , m} 

{tt<3>,L,M)(v,i),(w,j) = < l 2U ) 
I (%^ + l(*HWl ifj = /G$ 

Here (5' (Tjf+ligiZ ('UW )) r +i and (S f i(TK+1 (wv )) r +i denote the r + lth element of the 
vectors S aK+liqiZ (wv) G M p and Sf y(TK+1 (wv) G M p respectively. That is, if $ is 
finite, then H^>^,m represents a finite collection of Markov-parameters of <3>. In 
turn the Markov-parameters of <3> are defined via the high-order derivatives of 
the elements of $ with respect to the switching times. Hence, we get that the 
entries of -ff$,L,M are just high-order derivatives with respect to the switching 
times of the elements of $, and the order of these derivatives is bounded by 
L + M. 



6.3 Solution of the partial-realization problem 



As we remarked in the introduction, there are essentially two ways to construct 
a partial LSS realization. In Algorithm 1 we present the first one which is con- 
ceptually the simplest one. Informally, the realization (E N ,p, N ) returned by 
Algorithm 1 is defined on an isomorphic copy of the column space of H^ jN>N+1 . 
The j'th column of the matrix B q is the isomorphic copy of the column of 
H^ t N,N+i indexed by (q,j)- The matrices C q are such that if they are inter- 
preted as linear maps, then each C q maps the isomorphic copy of a column of 
H$ t N,N+i to its first p rows, i.e. rows indexed by (e, 1), (e, 2), . . . , (e,p), where 
e denotes the empty word. The matrices A q realize a shift on the columns of 
H$,n,n+i] by interpreting A q as a map on the columns of H^ >NjN+1 , A q maps 
the column indexed by an index (w,j) G Q- N x J$ to the column indexed 
by (wq,j). Finally, the value /xat(/) of the map /j,n for the input-output map 
/ G $ equals the column of H^ jN>N+1 indexed by (e, /). Notice that Algorithm 
1 may fail to return a realization, as (23) need not always have a solution. 

Remark 2 If Q consists of finitely many input-output maps, then Algorithm 
1 is a indeed an effective procedure and it can be implemented as a numerical 
algorithm. 

We state the partial-realization theorem as a theorem providing sufficient con- 
ditions for Algorithm 1 to yield a 2A"+l-partial, semi-reachable and observable 
realization of $. 

Theorem 3 (Existence of partial realization of LSSs) Let $ be a fam- 
ily of input-output maps admitting a generalized kernel representation. 

• Existence and computability of a partial realization 



27 



Algorithm 1 



1: Assume that n = rank H^ ; n,n+i and let S be a linear isomorphism from 
the column space of H$,n,n+i to R n . For each column index (w,j) G 
q<n+i x d eno -t; e by C w j the column of H<s>^,n+i indexed by (w,j). 

2: Define the LSS realization (T, N ,/j, N ) as follows. 
Sa? is a LSS of the form (5) such that 

• The state-space of Ejy is R n . 

• For each mode q G Q, the matrix £? ? G M nxrrt satisfies 



B q 



5(C €)(?i i)), 5(C e> ( gj2 )), ••• , <S(C ei(<?)m) ) 
• For each mode q <E Q, the matrix C q G R pxn satisfies 



(21) 



C q S(C w j) — 



(-f/*,JV,JV+l)(e,l),(,i;,j), (#<&,iV,jVH-l)( e ,2),(™J), ' ' > (-^*,JV,JV+l) (e,p),(w,j) 



T 



(22) 

for each w G Q- N+1 . 
• For each mode q E Q, the matrix A g G R" xri is the solution of the 
system of linear equations 

S^AgSiCyjj) = C wqd for each w G Q~ N ,j G J$ (23) 

If the system of equations (23) does not have a solution, then abort. 
3: For each input-output map / G <E», 

Mf) = S(C e)/ ) (24) 

4: Return (Ej^jv), with Ejy = {R n ,U,y,Q, {(A q ,B q ,C q ) \ q G Q}). 

Assume that for some N 3 N > 0, 

rank H$ jNjN = rank H^^+^n = rank H^ tNiN+1 (25) 

/ioWs. T/ien Algorithm 1 returns a realization (Y, N ,/i N ), and (E N ,fj, N ) is a 
semi-reachable and observable 2N + 1-partial realization of $. 
• Existence of a complete realization 

//rank Hq> NjN = rank H§, then (25) holds and the 2N + 1-partial realiza- 
tion (Ejv,/ijv) minimal realization o/$. TTie condition rank H$ NjN = 
rank /ioZe£s /or a given iV 7 i/ i/iere exists a LS^ realization (E, //) o/ $ 
swc/i t/ierf dim E < iV + 1 . 

The proof of Theorem 3 is presented in Section 7. Theorem 3 implies that if $ 
has a realization by a LSS , then the parameters of a minimal LSS realization 
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of $ are computable from, and hence are determined by, finitely many time 
derivatives with respect to the switching times. 

Remark 3 (Partial versus complete realization) The numerical exam- 
ple in Section 3 provides an example of a family of input- output maps $ re- 
alizable by a LSS, such that $ has the following property. For some N the 
family $ satisfies (25) but not the condition rank H^^n = rank H<$>. That is, 
there is an N e N such that Theorem 3 yields a partial realization of $ which 
is not a complete realization. That is, the two statements of Theorem 3 indeed 
describe two separate cases. 

Theorem 3 allows us to formulate the following characterization of minimal 
partial LSS realizations. 

Theorem 4 (Minimal partial realization) With the notation of Theorem 
3, if $ satisfies (25), then the following holds. 

(1) A minimal 2N + 1 partial realization of $ exists, in fact, the realization 
(Eat, //at) returned by Algorithm 1 is a minimal 2N + 1 partial realization 
of<$>. 

(2) Any minimal 2N + 1 partial realization of <£> is semi-reachable and ob- 
servable and it is of dimension rank H^ N N . 

(3) All minimal 2N + 1 partial realizations of $ are isomorphic. 

The proof of Theorem 4 is presented in Section 7. 

Remark 4 The reader might wonder why we talk about 2N + 1-partial real- 
izations in Theorem 3 and 4- The reason behind it is that the finite Hankel- 
matrix H$,n,n+i is formed by values of the Markov-parameters of order at 
most 2N + 1. We would like the LSS realization obtained from H^ t N,N+i to 
recreate at least those Markov-parameters which are the entries of the matrix 
H<s>,n,n+i- But this means that the LSS realization obtained from H^ jN;N+1 
must be a 2N + 1-partial realization of $. 

6.4 A Kalman-Ho-like partial-realization algorithm for LSS 

While Algorithm 1 is theoretically attractive because of its simplicity, it is 
not necessarily the most suitable one for numerical implementation. In Algo- 
rithm 2 we present an alternative algorithm for computing a semi-reachable 
and observable 2N + 1-partial realization of <3>. The algorithm is based on fac- 
torization of the Hankel-matrix H$ t x+i,N an d it is similar to the Kalman-Ho 
algorithm. 

The result of Algorithm 2 is described in the theorem below. 
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Algorithm 2 ComputePartialRealization(iJ$ i Ar + i a?) 

1: Compute a decomposition of H$ tN+ljN 



H, 



®,N+1,N 



OR 



where O G RW-^ 1 ^ 1 . 2 .-^})^ and i? G r x W s " XJ *) are matrices such 
that r = rank R = rank O = rank i?$,w+i,Ar- 
2: Recall from (16) the enumeration of Q. For each q <E Q, q = <Ji, for some 
i = 1, 2, . . . , D define the matrix C7 g G R pxr by 



Co 



(e,(i-l)p+l),.> Ly (e,(i-l)p+2),.' • • • ) ^(e.ip),. 



where . denotes the row of O indexed by k. 



3: For each q £ Q, define the matrix B q G 



by 



B„ 



R.,(e,(q,l)), R.,(e,(q,2)), R.,(e,(q,m)) 



where R.,(e,(q,j)) stands for the column of R indexed by (e, (q,j)) for q G Q 
and j — 1, 2, . . . , m. 
4: Define the map fl N : $ — > W as 

V/ G $ : ^(/) = i2., (e>/) 

where R.,( e ,f) stands for the column of R indexed by (e, /) for / G $. 
5: For each q G Q let the matrix A q G M rxr be the solution of equation 



(26) 



where T,T q G ~$l(Q <N x i 1 < 2 >->P D }) xr are matrices of the form 



(v,i),j 



0( v ,i),j (Xq)(v,i),j — 0(qv,i),j 



for all (v,i) G Q-^ x {1, 2, . . . ,pD}, j = l,2,...,r. That is, T is obtained 
from O by deleting all the rows indexed by pairs (v, i) where v is a sequence 
of discrete modes of length N + 1. The matrix T q is the shifted version of 
O, i.e. its row indexed by (v,i) is the row of O indexed by (qv,i). 
6: If there no unique solution to (26) then return NoRealization. Otherwise 
return (E N ,J1 N ) where Sjy is a LSS of the form 



\U,y,{(A q ,B g ,C g )\qeQ}) 
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Theorem 5 (Partial realization algorithm) With the notation above the 
following holds. 

(1) Assume that form some N > 0, (25) of Theorem 3 holds. Then Algo- 
rithm 2 always returns a LSS realization (Ejv,/Jjv) and (Ejv,//jv) is a 
minimal 2N + 1-realization o/$. In fact, (Ejv,/Jjv) is isomorphic to the 
LSS realization (Ejv, /Aw) of Theorem 3. 

(2) If for some N > 0, rank -£/$ ; jv,jv = rank H§, then (25) holds, and Algo- 
rithm 2 returns a minimal realization (Ejv,/Jjv) o/$. If there exists a LSS 
realization (S,/i) of $, such that dimE < N + 1, then rank -f/$ ; jv,jv = 
rank holds and the realization (Ejv,/Jjv) returned by Algorithm 2 is a 
minimal realization of $ . 

The proof of Theorem 5 is presented in Section 7. 

Theorem 5 has immediate consequences for systems identification and model 
reduction of linear switched systems, by guaranteeing correctness of the re- 
alization algorithm described in Algorithm 2. In Section 8 we discuss the 
potential applications of Theorem 3 and Theorem 5 in more detail. 

Remark 5 (Factorization of .f^jv+i.Ar) There are many algorithms to com- 
pute a factorization of H$^+i,n o,nd Algorithm 2 can work with any of these 
algorithms. In particular, the factorization of H<&^+i,n can be done through 
singular value decomposition. More precisely, assume that $ is finite, and let 
H$ t N+i,N — UT,V be the singular value decomposition of H$ N+1N . Notice that 
if <& is finite, then the index sets o/if$ ; jv+i,jv are finite and the singular value 
decomposition can be computed by enumerating the index sets of H$ 7 n+i,n 
and viewing if*,jv+i,jv as a usual finite matrix. The choice O = UY}I 2 and 
R = E 1 / 2 ^ yields a decomposition i?$,jv+i,jv = OR satisfying the conditions of 
Algorithm 2. Algorithm 2 has been implemented, and the factorization used in 
the implementation is precisely the singular value decomposition of H$ tN+ljN . 



7 Proof of the partial realization theorems for LSS 

The goal of this section is to present the proof of Theorem 3, 4, and 5. Both 
proofs rely heavily on the relationship between rational representations and 
LSSs described in [45,42]. The outline of the section is the following. In Subsec- 
tion 7.1 we will present a short overview of the relationship between rational 
representations and LSSs . Subsection 7.2 presents the proof of Theorem 3, 4, 
and 5. The reader is advised to review Appendix A and Appendix B before 
reading this section. Throughout the section we will tacitly use the notation 
and terminology of Appendix A and Appendix B. 
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7. 1 Rational representations and LSSs 



Below we will present a brief overview of the relationship between formal power 
series and their representations and LSSs . For a more detailed presentation 
see [45,42]. Informally, the relationship is as follows. 

• Let $ be a family of input-output maps and assume that $ has a general- 
ized kernel representation. Then we can construct a family of formal power 
series from the generalized Markov-parameters of The details of the 
construction will be presented below. The family will be referred to as 
the family of formal power series associated with $. 

• If (£,//) is a LSS realization of <3>, then we can construct a rational repre- 
sentation i? S)/1 of from the parameters of The converse is also 
true; if R is a representation of then we can construct a realization 

of $ from the parameters of R. The details of the construction 
will be presented below. We will call R-e,^ the representation associated with 
and we will call (S R ,/x R ) the realization associated with R. 

We will start with defining the family of formal power series associated with 
$. To this end, recall from Definition 16 the definition of Markov-parameters 
S qtq0t j(w), Sf tq (w) of $, for each pair of discrete modes q, qo G Q, input-output 
map / G $, index j = 1,2, ...,to, and a word w G Q* . It is easy to see 
that the maps S qm j : Q* 3 w — > S qm j(w) G M p and Sf >q : Q* 3 w — > 
Sf tq (w) G MP define formal power series S qtqo j and Sf >q in MP Q* ^>. Recall 
the enumeration of the set of discrete modes Q defined in (16). That is, Q 
assumed to have D elements given by the distinct elements <7i, a"2, . . . , o~d- For 
each discrete mode q G Q, index j — 1, 2, . . . , to, and input-output map / G $ 
define the formal power series S q j,Sf G R pD <C Q* 3> as follows; for each 
word u> G Q* let 



5 wH 
S>(«0 



(%,») T , (5 /iCT2 Hf,...,(^H) 



oD 

(27) 



That is, the values of the formal power series S q j are obtained by stacking 
up the values of S auq j for i = 1,2, . . . , D. Similarly, the values of Sf are 
obtained by stacking up the values of Sf :(Ti for % — 1, 2, . . . , D. Define the set 
Jq> = $ U {(q,z) | q G Q, z — 1, 2, . . . , to}. Define the indexed set of formal 
power series associated with $ as the following family of formal power series 



= {Sj G M pU < Q* >| j G J$} (28) 



Remark 6 (Equivalence of definitions of the Hankel-matrix) R is easy 
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to see that the Hankel-matrix Hy^ of the family of formal power series is 
identical to the Hankel-matrix of $ as defined in Definition 17 



Next we define the rational representation i? SjU associated with a LSS realiza- 
tion (S,/i). Let E be a LSS of the form (5) and assume that u : $ — > X is a 
map assigning to each element / G $ an initial state of S. 

Construction 1 (Representation associated with a realization) Define 
the representation associated with (S,/i) as the rational pD — J$ representa- 
tion 

Ry,,^ = (X , {A q } qt =Q, B, C) 
The various components of are defined as follows. 

• State-space X. The state-space of the rational representation -Re iM is the 
same as the state-space o/ E, i.e. W 1 = X . 

• Alphabet. The representation i?£ iM is defined over the alphabet which 
equals the set of discrete modes Q. 

• State-transition maps (matrix) {A q G W n } ql =Q. For each discrete mode 
q G Q, the corresponding state-transition matrix A q of Re,h is identical to 
the matrix A q o/E. 

• Readout matrix (map) C G W Dxn . The readout matrix C is obtained 
by vertically "stacking up" the matrices C ai , . . . , C UD in this order from top 
to bottom. That is, the pD x n matrix C is of the form 

c=[c T ai cl,... cl D] 

• Initial states. The indexed set of the initial states of is of the form 

B = {Bj G X | j G J$}, 

i.e. it is indexed by the elements of the index set J$. Its elements are defined 
by Bf = fi(f) if f is an element of the family $ 7 and B( q n = (B q ) j, i.e. 
B( q ^ is the Ith column of B q , for all q G Q and I — 1, 2, . . . m. 

Conversely, below we will construct a LSS realization (E#, ixr) from any ratio- 
nal representation R satisfying some mild conditions. We would like to note 
that these conditions are automatically satisfied by a suitably chosen isomor- 
phic copy of any rational representation of The definition goes as follows. 



Construction 2 (Realization associated with a representation) 

Consider a pD — J$ representation R of the following form 

R=(X,{A q } qeQ ,B,C) (29) 
and assume that the following holds. 
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(1) The state-space is of the form X = R n for some n > 0. Qj 

T/ie readout map C takes its values in W D . 
(3) The set of initial states B of R is indexed by the index set J$ = $U{(g, j) \ 

q G Q,j = 1,2,..., m}. 

Define the LSS realization /i_r) associated with R as follows. Let be 
of the form (5), where 

• State-space. The state-space W 1 o/£# is the same as that of R. 

• System matrices {A q G R nxn } gg Q of For each discrete mode q G Q, 
the matrix A q of is identical to the state-transition matrix A q of R. 

• System matrices {C q G M. pxn } q( zQ of For any discrete state q G Q 
of the form q = o~i for some i = 1, 2, . . . , D and for any I = 1, 2, . . . , p, the 
Ith row of C q equals the p{i — 1) + l-th row of C , i.e. 



C 



System matrices {B q G M nxm } ge g o/Sr. For each discrete mode q G Q 
the n x m matrix B q is obtained as follows; the Ith column of B q equals the 
initial state B q j for all I — 1,2, ... ,m. That is 



B„ 



B(q,l), £( g ,2), 



) -^(q,m) 



The map /in : $ — > X assigns to each element f of $ the initial state of R 
indexed by f, i.e. 

H R (f) = Bf for all f G 



The following theorem states the relationship between representations and 
realizations formally. 

Remark 7 If we apply Construction 1 to (E/j,/^) then the resulting repre- 
sentation Rt, r ,^ r coincides with R, i.e. Rz R ^ R = R- Conversely, if we apply 
Construction 2, to the representation Rt,,^ associated with a LSS realization 
then we get back, i.e. S_r Em = S, I^r s m = 

Theorem 6 ([44,42,45]) Let $ be a family of input-output maps and as- 
sume that $ admits a generalized kernel representation. With the notation 
and assumptions above the following holds. 

• The realization is a LSS realization of $ if and only if Rt,,^ is a 

rational representation o/\I/$. 

2 If X = W 1 does not hold, then replace R with the isomorphic copy SR defined 
in (A. 7), Remark 9 whose state-space is W 1 . Since SR and R are isomorphic, if R 
is a representation of \]/$, then SR will be a representation of as well. 
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• The representation R is a representation of if and only if (£r, [ir) is a 
realization of $ . 

• The realization (£,//) is semi-reachable if and only if Re,h is reachable, and 
(S, A*) is observable if and only if R-z,n is observable. Similarly, (£r, Hr) is 
semi-reachable if and only if R is reachable, and is observable if 
and only if R is observable. 

• //(£, /i) is a minimal realization o/$, i/ien -Rs lM is a minimal representation 
of^$. Conversely, if R is a minimal representation ofty$, then (£#,/zr) 
is a minimal realization of $ . 

• The map S : (£, /i) — )■ (S , fi ) is a LSS morphism if and only if S : Re,^ 
R^> ' is a representation morphism\^\ In addition, S is a representation 
isomorphism if and only if S is a LSS isomorphism. Conversely, if the 
representations R and R satisfy the assumptions of Construction 2, then 
S : R — >■ R is a representation morphism if and only if S : (£r, /j,r) — > 
(E R i,fi R >) is a LSS morphism. Moreover, in this case S is a representation 
isomorphism if and only if it is a LSS isomorphism. 

7.2 Proofs of the results on partial realization of LSSs 

The goal of this section is to present the proof of Theorem 3, Theorem 4 and 
Theorem 5. Before proceeding to the proof, we will need the following result 
on the relationship between the partial-realization problem for LSSs and the 
partial-realization problem for rational representations. 

Theorem 7 Assume that $ is a family of input-output maps admitting a 
generalized kernel representation. A LSS realization (S, /x) is an 2N + l-partial 
realization of $ if and only of the associated representation Re,^ is a 2N + 1- 
partial representation of the family of formal power series Conversely, a 
representation R is a 2N + 1-partial representation of the family of formal 
power series associated with $, if and only if the LSS realization //r) 
associated with R is a 2N + 1-partial realization of $. 



PROOF. The second statement of the theorem follows from the first one by 
noticing that R = Rn R ,n R - Hence, it is enough to prove the first statement 
of the theorem. It follows easily from Definition 18 and the definition of the 
formal power series Sf,S qo j, f G $, go G Q, j = 1,2, ... ,m, that (£,//) is 
an 2N + 1-partial realization of if and only if for all w G Q- 2N+1 , go G Q, 



3 Notice that the state-space of coincides with that of Re,h and the state- 

space of (X , fi ) coincides with that of / . Hence S can indeed be viewed as 
a representation morphism, provided that it commutes with the matrices of the 
representations involved 
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/ e $, j = 1,2,..., m 





-iT 

A w Bq 6j 



(30) 




'T /tT 



From the definition of -Re,^ it follows directly that (30) is equivalent to -Rs )At 
being an 2iV + 1-representation of 

Now we are ready to present the proof of Theorem 3. 

PROOF. [Proof of Theorem 3] First, recall from Definition 29 the definition 
of the matrix H$, 9 k,l for some K, L > 0. It is easy to see that the matrix 
Hq>,K,L as defined in Definition 20 and the matrix H<s> 9) k,l coincide, in par- 
ticular rank H<$> k,l — rank H^,^^,l- Recall from Remark 6 that the matrices 
H$ and H<$, 9 coincide and hence their ranks are equal as well. We will prove 
the two statements of the theorem separately. 

Proof of existence and computability of a partial realization 
We will apply Theorem 10 to ^><&. Using the remark above, the condition 
(25) can be rewritten as (B.2) of Theorem 10. Consider the pD — J$ repre- 
sentation R N = (lmHq, jN;N+1 , {A q } qe Q, B, C) defined in Theorem 10. Since 
(B.2) holds, we get that Rn is well-defined and it is a 2iV + 1-partial repre- 
sentation of and Rn is reachable and observable. Consider Algorithm 
1 and the isomorphism S defined there. It is clear that S maps the col- 
umn space of H-q, it N,N+i = H$,n,n+i to W l with n = rank H^ t N,N+i- Con- 
sider the isomorphic copy SRn of Rn via the morphism S, i.e. SRn = 
(W n , {SAqS'^qeQ, {S(Bj) I j e J$}, CS- 1 ). Obviously, SR N is also a 2N+ 1- 
partial representation of and it is reachable and observable. 

Consider now the LSS realization (S 5Rjv , Hsr n ) associated with SR^, as de- 
fined by Construction 2. 

Notice that if (25) holds, then (23) of Algorithm 1 has a unique solution. 
Indeed, (23) can be rewritten as A q R = R q . Here R,R q E R^W-"*'*) and 
for each (w,j) G Q- N x 7$, the column of R indexed by (w,j) is the image 
<S(Cw,j) of the column of H^^n indexed by (w,j); the column of R q indexed 
by (w,j) is the image S(C wq j) of the column of -ff$,7v,v+i indexed by (wq,j). 
The column space of R is the column space of -ff$,jv,iv by S, hence if (25) holds, 
then rank R = n. That is, the solution of A q R = R q , if it exists, is unique. 

Hence, if (25) holds, then the LSS realization (Ejv,//jv) is uniquely defined 
by (23), (22), (21) and (24), if it exists. It is easy to see that the LSS realiza- 
tion (S 5Rjv , Hsr n ) satisfies (23), (22), (21) and (24) and hence it coincides with 
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(Ejy, //jv)- In other words, the LSS realization (Ejv, //at) returned by Algorithm 
1 equals the LSS realization (£5^, Hsr n ) associated with the isomorphic copy 
SRn of -Rat, i.e. (£ 5RiV , ^srm) = (J^n^n)- By Theorem 7 it follows then that 
(E N ,fi N ) is a 2iV + 1-partial realization of $. In addition, from Theorem 6 it 
follows that (Ejy, /Aw) is semi-reachable and observable. 

Proof of existence of a complete realization 

In addition, from Theorem 10 it follows that if rank H$n,n = rank H$>, i.e. 
rank H^ it N,N = rank Hy^, then Rn is a minimal representation of and 
hence so is SRn- But then by Theorem 6, (Esr n , I^sr n ) = (Eat, /Mr) is a 
minimal realization of Finally, assume that there exists a realization (E, /x) 
of <£>, such that dim E < N. Consider the representation R = Rt,^ associated 
with (S,/i) as defined in Construction 1. Notice that dimi? = dimE < N. 
Then by Theorem 6, R is a representation of Applying Theorem 10 we 
get that rank H^^ = rank H^,^ t N,N and hence rank Hq> = rank H&^n holds, 
and that Rn, and hence SRn, is a minimal representation of Again, 
from Theorem 6 it follows then that (Y, n ,ij n ) = (E^^, fisR N ) is a minimal 
realization of $. 



Next, we will present the proof of Theorem 4 



PROOF. [Proof of Theorem 4] We will prove the statements of Theorem 4 
one by one. Notice that if $ satisfies (25), then the associated family of formal 
power series satisfies (B.2). This allows us to use Theorem 11 and Theorem 
7 to prove the statements of the theorem. 

Proof of Part 1 

From the proof of Theorem 3 it follows that the LSS realization (E N ,fi N ) is 
the realization associated with an isomorphic copy of the representation R N . 
By the remark above the family associated with $ satisfies the condition 
of Theorem 11 and hence by Theorem 11 the representation Rn is a minimal 
2N + 1-partial representation of ^5/<s>. Then Theorem 7 implies that (Ejv, A*at) 
is a iV-partial realization of $ and its dimension is dimit^. It is easy to see 
that (Ejv,A*at) is a minimal 2N + 1-partial realization of $. For assume that 
(E, n) is a 2N + 1-partial realization of <3>. Then by Theorem 7 the associ- 
ated representation is a 2iV + 1-partial representation of ty$>, and hence, 
dim Eat = dimiiV < dimi?s iAt = dimE. 

Proof of Part 2 

If (E,/t) is a minimal 2N + 1-partial realization of $, then from Theorem 7 
it follows that Ry,,h is a minimal 2N + 1-partial representation of Indeed, 
consider a 2iV + 1-partial representation R of ^I/^. Using Remark 9, we can 
replace R by a suitable isomorphic copy, state-space of which is R™ with n = 
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dimi?. This isomorphic copy will also be a 2iV + 1-partial representation of 
Hence, without loss of generality we can assume that the state-space of R 
is M. n for n = dim R and hence Construction 2 can be applied. Then it follows 
from Theorem 7 that fi R ) is a 2N + 1-partial realization of <3>. Hence, 
dimi?s, M = dimS < dimE# = dimi? by minimality of 

But then by Theorem 11 the representation R-E )fl has to be reachable, observ- 
able, and of dimension rank Hq,^^ ^ = rank H^ ^^. By Theorem 6, the latter 
means that (£, ji) is semi-reachable, observable, and of dimension rank H§^ N N . 

Proof of Part 3 

Let and (S,/t) be two minimal 2N + 1-partial realizations of $. Using 

an argument analogous to the one presented in the proof of Part 2, one can 
show that the associated representations i?s u and Ry - are minimal 2N + 1- 
partial representations of the family of formal power series associated 
with <3>. From Theorem 11 it follows that Rt,,^ and Rf. ^ are isomorphic. From 

Theorem 6 it then follows that and (S,/t) are isomorphic. 

We will continue with the proof of Theorem 5 

PROOF. [Proof of Theorem 5] The proof of the theorem relies on the fol- 
lowing observation. Assume that Algorithm 3 is applied to If we use the 
same factorization algorithm for factorizing the matrix ii^^+i^ = ii^jv+i.iv 
in both Algorithm 2 and Algorithm 3, then the following holds. The equation 
(26) has a unique solution if and only if (B.10) has a unique solution. In 
addition, Algorithm 2 returns a LSS realization if and only if Algorithm 3 
applied to H<s> t N+i,N = ii^jv+i.iv returns a rational representation R^, and 
the realization (Stv,/^v) returned by Algorithm 2 is the LSS realization as- 
sociated with the representation R N , as defined in Construction 2. That is, 
(Eat, /Mr) = (£^,/^J. 

Proof of Part 1 

Recall that for all L, M > 0, the matrix Hq> t L,M is identical to -f/^^M, and 
that the Hankel-matrix Hq, is identical to H^^. Hence, by applying Theorem 
12 to s ,jv+i, n we get that if (25) holds, then Algorithm 3 returns a 2iV-|- 1- 
partial representation R^ of Then from the discussion above it follows 
that Algorithm 2 returns a realization (£jv,/ijv) and (T^n^n) = (^r^? ^r n )- 
Then from Theorem 7 it follows that (Eat, J^n) is a 2N + 1-partial realization 
of Moreover, Theorem 12 implies that in this case R^ is isomorphic to the 
representation Rn from Theorem 10. Recall from the proof of Theorem 3 that 
the realization (Sat, /in) from Theorem 3 is the realization associated with the 
isomorphic copy SRn of the representation R^, as defined in Construction 
2. Since Rjy and Rjy are isomorphic, the representations SRn and Rjy are 
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isomorphic as well. Then by Theorem 6 the realizations (Eat, A* at) and (Eat, Hn) 
are isomorphic as well. 

Proof of Part 2 

If rank H$ = rank H^ >N>N , then by Theorem 3, (25) holds and (Ejv,/ijv) is a 
minimal realization of $. Similarly, if there exists a LSS realization (E,/i) of $ 
such that dimE < N, then by Theorem 3, rank H$> = rank H^^,n, and (25) 
holds, and (Ejv, //at) is a minimal realization of $. By the preceding argument, 
in both cases Algorithm 2 returns a realization (E N ,p, N ) and (E^/Ijv) is 
isomorphic to (Ejv, /xjv)- Hence, in both cases the realization (E N ,fi, N ) returned 
by Algorithm 2 is a minimal realization of $. 



8 Discussions and Conclusions 

In this paper we presented basic results on partial-realization theory LSSs. 
We also discussed the algorithmic aspects of the theory and presented two 
algorithms for computing a partial LSS realization. We have also shown that 
under suitable conditions, the obtained partial realizations are in fact (com- 
plete) minimal realizations of the input-output maps at hand. 

The paper represents only the first steps towards partial-realization theory 
of hybrid systems. A number of problems remains open. In particular, we 
would like to extend the results of the current paper to other classes of hy- 
brid systems. Another important research direction is the improvement of the 
computational complexity of the presented algorithms. 

As we mentioned it earlier, the results of the paper can potentially be useful 
for the solution of a number of problems, namely, for system identification, 
model reduction and possibly fault detection and computer vision. Below we 
will discuss these potential application domains in more detail. 

Systems identification algorithms 

As the history of development of system identification demonstrates, partial- 
realization theory can be used to design algorithms for system identification. In 
particular, the famous subspace identification algorithms [11,53] rely heavily 
on partial-realization theory. 

In fact, based on the results of the paper one can derive the following crude 
identification algorithm. Consider a finite family $ = / 2 , . . . , fa} of input- 
output maps. We would like to find a LSS realization of $. Fix a natural 
number N. The number N may be chosen either based on our belief on the 
dimension of a potential LSS realization of $, or our ability/ willingness to 
make measurements of the system responses for switching sequences up to 
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length 2N + 1. 

1: For each / € <&, estimate the first-order time derivatives of -D^ 1 ' 1 ' 1 ''"' 1 ' ^^,^ 

and i5^ 1 ' 1 ' 1 '"'' 1 '°^/e ; ,-,gowg ~~ £^ 1 ' 1 ' 1 '"'' 1 '°^/o,gou>9 f° r an sequences of discrete 
modes w of length at most 2N + 1 and all discrete modes q ,q G Q. Use 
these estimates to construct the Markov-parameters of order up to 2N+1. 
Subsequently, use the thus obtained Markov-parameters to construct the 
Hankel-matrices H$ t N,N, #*,jv+i,jv and i/$ ) jv,jv+i- 
2: If rank H®^^ = rank H$ t N+i,N = rank Hq^n+i, then compute the 2iV+ 
1 partial realization (Ejy-,A*jv) either by Algorithm 1 or Algorithm 2. 

It follows immediately from Theorem 3 and Theorem 5 that if $ can indeed be 
realized by a LSS , then for large enough N the procedure above will return a 
minimal LSS realization of $. In fact, if iV + 1 is the dimension of a potential 
LSS realization of $, then the algorithm above always returns a minimal LSS 
realization of $. 

The algorithm described above has several drawbacks. For example, it is un- 
clear how to obtain in practice the derivatives needed to compute the gener- 
alized Markov-parameters. Another problem that the algorithm does not take 
into account measurement noise. The authors are hopeful that these short- 
coming can be overcome, as similar problems were successfully dealt with in 
the case of linear systems. However, we would like to remark that turning 
partial-realization theory to practically usable identification algorithms might 
take considerable amount of time; in case of linear systems it took several 
decades. 

Spaces of LSSs 

Partial realization theory can potentially be useful for studying the geometry 
and topology of the space of LSSs. In turn, the latter is useful for paramet- 
ric system identification, as it helps to formalize the well-known paradigm of 
system identification: "find a model of a certain class which matches the mea- 
sured data best" . In addition, the geometric insights and the resulting notions 
of distance between systems can be useful for fault detection and computer 
vision [37,8,9,54,46]. For bilinear systems, the first steps were made in [48]. 
For linear systems, the topology and geometry of Hankel-matrices were inves- 
tigated in [16,3,36,25,24,26,27]. Note that spaces of linear systems were also 
studied using a different approach, namely, by looking at equivalence classes 
of minimal linear systems under algebraic similarity. This approach resulted in 
deep results and insights in the geometry of spaces of systems and important 
algorithms for parametric identification, see [23,22,21,19,20,38,4,5]. 

Model reduction 

The presented results on partial-realization theory can also be used for model 
reduction. Recall from [17,18,1] the moment matching approach to model re- 
duction of linear systems. The core of this approach is to approximate a high- 
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order to system with a lower order one, such that the lower order system is 
a a partial realization of a finite number of Markov-parameter of the origi- 
nal system. For LSSs the role of Markov-parameters (or moments) is played 
by the generalized Markov-parameters. Hence, one could try to extend the 
moment matching approach to LSSs and replace a high-order system with a 
lower order one which is a partial-realization of a finite sequence of generalized 
Markov-parameters. Of course, a great number of issues needs to be solved. 
In particular, obtaining good numerical algorithms will be a challenge. 

Extension to more general classes of hybrid systems 

We believe that partial-realization theory for LSSs can be useful for obtaining 
realization theory, system identification and model reduction for piecewise- 
affine hybrid systems with guards. One reason for this is that a piecewise-affine 
hybrid system with guards can be viewed as a feedback interconnection of a 
LSS with an event generator. The former is just the collection of the continu- 
ous subsystems of the piecewise-afnne hybrid system, the latter is constructed 
from the guards of the piecewise-affine hybrid system. Understanding the re- 
alization theory of one of the components of this feedback loop should help 
to understand the realization theory of piecewise-affine hybrid systems. The 
second reason is that LSSs form a subclass of piecewise-affine hybrid systems, 
by identifying each LSSs with a piecewise-affine hybrid system whose guards 
depend only on the choice of inputs. Hence, any result on (partial-) realiza- 
tion theory of piecewise-affine hybrid systems should be consistent with the 
corresponding results for LSSs. 

Acknowledgment The authors thank Pieter Collins and Luc Habets for 
useful discussions and suggestions. 
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A Formal Power Series 

The section recalls basic results on formal power series. The material of this 
section is based on the classical theory of formal power series, see [2,34]. How- 
ever, a number of concepts and results are extensions of the standard ones 
to families of formal power series. The outline of the section is the following. 
In Subsection A.l we present the basic concepts for formal power series. In 
Subsection A. 2 we present the notion of a rational representation of a family 
of formal power series. In Subsection A. 3 we state the main result on existence 
and minimality of a rational representation of a family of formal power series. 

A.l Formal Power Series: Definition and Basic Concepts 

Let X be a finite set, which we will refer to as the alphabet. Recall from 
Section 4.0.3 the notion of a finite word over an alphabet and the related 
notation. A formal power series S with coefficients in IR P is a map 

S : X* ->■ W 

There are many ways to give an intuition for the definition of a formal power 
series. For the purposes of this paper, the most suitable one is to think of a 
formal power series as an output of a machine defined as follows. The machine 
reads symbols belonging to X from its input tape and writes elements of IR P 
onto its output tape. We denote by W <C X* ^> the set of all formal power 
series with coefficients in W. The set of all formal power series over X with 
coefficients in M p forms a vector space with respect to point-wise addition and 
multiplication. That is, if a, G R are two scalars and ^TeffC X* ^> are 
two formal power series, then the linear combination aS + j3T is defined as the 
formal power series assigning each word w G X* the value aS(w) + j3T(w). 

In the sequel we will mostly be interested in families of formal power series. 

Definition 21 (Familly of formal power series) Let J be an arbitrary (pos- 
sibly infinite) set. A family of formal power series in MP <C X* ^> indexed by 
J is simply a collection \& = {Sj 6K P < X* 3>| j G J} of formal power series 
from M p <C X* ^> indexed by elements of J. 

Notice that the definition above does not require Sj, j G J to be all distinct 
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formal power series, i.e. Si = Sj for some indices j, I G J, j 7^ / is allowed. 
One can think of a family of formal power series as a family of input-output 
maps of the machine (discrete-time system) described above, realized from 
a set of initial states indexed by elements of J. We would like to point out 
that the notion of (family of) formal power series is a purely formal one, the 
interpretation of them as input-output maps is just one of the many possible 
interpretations. 

A. 2 Rational Representations and Rational Formal Power Series 

Above we have defined the notion of a formal power series and a family of 
formal power series and we have related these notions to input-output maps 
of some systems. Below we will recall the notion of a rational representation, 
which can be thought of as a special subclass of these systems. We will also 
define morphisms between rational representations along with a notion of ob- 
servability and reachability. 

Definition 22 (Representation) Let J be an arbitrary set and letN 3 p > 
0. A rational representation of type p-J over the alphabet X is a tuple 

R = (X,{A a } aeX ,B,C) (A.l) 

where 

• The space X is a finite dimensional vector space over R, called state-space 
ofR, 

• For each letter a G X , A a : X — >■ X is a linear map, referred to as the 
state-transition map 

• The map C : X — >■ W is a linear map, referred to as the readout map, 

• The family B = {Bj G X \ j G J} is a collection of (not necessarily distinct) 
elements of X indexed by J. 

If p and J are clear from the context, then we will refer to R simply as a 
rational representation. 

The dimension dim X of the state-space is called the dimension of the repre- 
sentation R and it is denoted by dimi?. 

Remark 8 Notice that if a basis of X is fixed and n = dimX, then the 
state-transition maps A a , a G X and the readout map C can be identified 
with their matrix representations in this basis, and for each j G J , Bj can be 
identified with the W 1 column vector of its coordinates. If X = M. n , then we will 
identify the linear maps A a , a G X and C with their matrix representations in 
the standard orthogonal basis ofW 1 . In this case we will call them the state- 
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transition matrices and the readout matrix respectively. 

Definition 23 (Rational family of formal power series) Let* = {Sj G 
W 3 <C X* ^>| j G J} be a family of formal power series indexed by J. The 
representation R from (A.l) is said to be a representation of *, if for each 
index j G J , for all sequences a±, 2 , ■ ■ ■ , o k G X, k > 0, 

Sjfaaz ■■■o- k )= CA^A^ ■ ■ ■ A ai B r (A.2) 

We will say that a family of formal power series * = {Sj eff-C X* j G 
J} is rational, if there exists a representation R such that R is a representation 
of*. 

Notation 4 The following notation will greatly simplify the expressions used 
for rational representations. Let A a : X — > X, o G X be linear maps and let 
w = o~\0~2 ■ ■ ■ °~k G X* , oi, 0- 2 , • • • (Tfc G X, k > 0, be a word over X . Then A w 
denotes the composition of the linear maps A ai , A a , 2 , . . . A ak in that order, that 
is 

A w = A - k A <Tk _ 1 A ai (A.3) 
Ifw — e is the empty words, then A t is taken to be the identity map. 

With the notation above, (A.2) can be rewritten as Sj(w) = CA w Bj for all 
w G X*, j G J. Notice that if X = R n for some N 3 n > 0, and hence 
the state-transition maps A a can be viewed as matrices, then (A.3) defines a 
notation for products of the state-transition matrices taken along the word w. 

Definition 24 (Minimality) A representation R min of* is called minimal, 
if for each representation R of*, dimR min < dimi?, i.e. R m in is a rational 
representation of * with the smallest possible state-space dimension. 

We will continue with presenting the notions of observability and reachability 
for rational representations. Define the subspaces Wr and Or of X by 



W R = Span{A w B j G X \w G X*,j G J} (A.4) 
O r = H ker CA W (A.5) 

w£X* 

That is, the subspace Wr is the linear span of the elements of the state-space 
of the form A w Bj, where w runs through all words over X and j runs through 
all the indices in J. The space Or is the intersection of the null-spaces (kernels) 
of all the linear maps CA W : X — > R p , where w runs through the set of all 
words X*. 

Definition 25 (Reachability) We will say that the representation R is reach- 
able if dim Wr = dim_R. The subspace Wr will be referred to as the reacha- 
bility subspace of R. 
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Definition 26 (Observability) We will say that R is observable if Or = 
{0}, i.e. Or consists of the zero element only. The subspace Or will be referred 
to as the observability subspace of R. 

Next, we define the notion of morphism between rational representations. 

Definition 27 (Representation morphism) Let R = (X, {A a } aeX , B, C), 
R = (X, {A a } aeX , B, C) be two p-J rational representations. A linear map 
S : X — >■ X is called a representation morphism from R to R and is denoted 
by S : R — )■ R if S commutes with A a , Bj and C for all j G J , o G X , that 
is, if the following equalities hold 

SA a = A a S, V<7 G X, SB 3 = Bj,Vj G J, C = CS (A.6) 

The representation morphism S is called surjective, injective, isomorphism if 
S is a surjective, injective or isomorphism respectively if viewed as a linear 
map. 

Remark 9 If R = (X, {A a } ae x, B, C) is a representation of^f, then for any 
vector space isomorphism S : X — > M, n , n = dimi? ; the tuple 

SR = (R n , {SAvS-^extSB, CS' 1 ) (A.7) 

where SB = {SBj el B | j G J} is also a representation of ^ . Moreover, for 
all a G X , SAfjS' 1 can be naturally viewed as a matrix by taking its matrix 
representation with respect to the natural basis ofW 1 . Similarly, by taking ma- 
trix and vector representations of CS^ 1 and SBj, j £ J in the natural basis 
ofR n , we can view CS^ 1 and SBj, j&Jasapxn matrix and n x 1 vector 
respectively. Moreover, S : R — » SR is a representation isomorphism. That 
is, we can always replace a representation of \& with an isomorphic represen- 
tation, state-space of which is M, n for some n, and the parameters of which 
are matrices and real vectors, as opposed to linear maps and elements of ab- 
stract vector spaces. Moreover, isomorphisms clearly preserve such properties 
as observability, reachability, and minimality. 

A. 3 Existence and Minimality of Rational Representations: Main Results 

The purpose of the section is to state the main results on existence and mini- 
mality of representations of families of rational formal power series. 

We will start by stating the main result on the existence of a rational rep- 
resentation. However, in order to state the main theorem, we need to de- 
fine the concept of the Hankel matrix of a family of formal power series. Let 
^ = {Sj X* j G J} be a family of formal power series. Recall the 

notation of Section 4.0.2. 
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Construction 3 (Hankel-matrix) Define the Hankel-matrix of \& as the 

following infinite matrix Hq,. The rows of Hq, are indexed by pairs (v,i) where 
v G X* is an arbitrary word and i = 1, . . . ,p. The columns of Hq, are indexed 
by pairs (w,j) where w G X* is a word over X and j G J runs through 
the elements of J. The set of all real infinite matrices with the indexing the 
rows and columns described above will be denoted by ]R( x * x/ ) x ( x * xJ ) where 
I = Hence, we can write Hq, G M. ( - X * xl ^ x( - X * xj \ The entry of Hq, 

lying on the intersection of the row indexed by (v, i) and the column indexed 
by (w,j) is defined as 



where (Sj(wv))i denotes the ith entry of the column vector Sj(wv) G W. 

According to the convention adopted in Section 4.0.2, we define the rank of 
Hq,, denoted by rank Hq,, as the dimension of the vector space spanned by the 
columns of Hq,. With the notation above the following theorem holds. 

Theorem 8 (Existence of a representation, [44,42]) The family^ is ra- 
tional, i.e. admits a rational representation, if and only if rank Hq, < +oo, 
i.e. the rank of the Hankel-matrix Hq, is finite. 

The proof of the above theorem can be found in [44,42]. 

As the next step we will present below the main result on minimality of rational 
representations. 

Theorem 9 (Minimal representation, [44,42]) Let * = {Sj G W < 

X* j G J} be a family of formal power series. The following are equivalent. 

(i) R m in is a* minimal representation of ^ . 

(ii) Rmin is reachable and observable. 

(Hi) If R is a reachable representation of '\& , then there exists a surjective repre- 
sentation morphism S : it! — > R m in- 
(iv) rank Hq, = dimR min . 

In addition, all minimal representations of \& are isomorphic. 

The proof of the above theorem can be found in [44,42]. 

Remark 10 (Related work) The counterpart of the above two theorems, 
i.e. Theorem 8 and 9, for a single formal power series is a classical result, see 





[12,2,34,49,50]. 
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B Partial-realization theory of formal power series 

Below we will formulate and solve the counterpart of the partial realization 
problem for families of formal power series. We use the obtained result to 
derive a solution to the partial realization problem for linear switched systems 
formulated in Problem 2. We start with defining the notion of a iV-partial 
representation. 

Definition 28 (iV-partial representation) Let * = {Sj G P « I* »| 

j £ J} be a family of formal power series indexed by J. A J-p representation 
R = (X, {A x } xeX , C, B), with B = {Bj G X \ j G J} is said to be an N- 
partial representation of \I/ if for each index j G J and each word w G X* of 
length at most N , i.e. \w\ < N , 

Sj(w) = CA w Bj 

That is, if R is a iV-partial representation of then R recreates the values 
of the elements of \I/ for all the words of length at most N. Now we are ready 
to formulate the partial-realization problem for formal power series. 

Problem 3 (Partial-realization problem for formal power series) Let 

^ be a family of formal power series indexed by J. 

• Find conditions for existence of a N -partial representation of^/. Formulate 
an algorithm for computing a N -partial representation of 

• Characterize minimal N -partial representations of \& 7 their existence and 
uniqueness. 

• Find conditions under which the N -partial representation above becomes a 
rational representation of ^ in the sense of Definition 22. 

We will devote the rest of the section to solving the problem described above. 
The outline of the section is the following. Subsection B.l presents the main 
results on partial-realization theory of families of formal power series, along 
with a Kalman-Ho-like algorithm for computing a minimal partial representa- 
tion. Subsection B.2 presents the proof of the results presented in Subsection 
B.l. Throughout the section, \1/ will denote the family of formal power series 

• = {Sj Gi p «r >| j g j}. 

B.l Main results on partial-realization theory of formal power series 

The goal of the section is to present the main results on partial-realization 
theory of families of formal power series. Note that partial-realization theory 
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of a single formal power series is more or less equivalent to partial-realization 
theory of bilinear and state-amne systems. The latter was already investi- 
gated in [29,15,31,49,50,11,7]. The results to be presented below represent 
an extension of the results of [29,15,31,49,50,11,7]. In addition, here we state 
partial-realization theory directly for formal power series. This is in contrast to 
[29,15,31,49,50,11,7], where the partial-realization problem and solution were 
stated in terms of state-amne or bilinear systems. 

The outline of the section is the following. Subsection B.l.l presents the state- 
ment of the main results on partial realization theory for formal power series. 
Subsection B.1.2 presents an algorithm for computing a partial representa- 
tion, which is similar to the well-known Kalman-Ho algorithm for the linear 

systems. 



B.l.l Partial realization theory 

We will start with defining the upper-left block matrix of which is indexed 
by words over X of finite length. More precisely, fix natural numbers M, K > 
and define the following sets 

Im = {{v, i) | v G X\ \v\ < M, % = 1, . . . ,p} 

Jk = {(w,j)\ 3 eJ,weX*,\w\<K} [ 

Intuitively, Im is a subset of the row indices of the Hankel-matrix H^, made 
up of indices of the form (v,i), where v runs through all the words of length 
at most M. Similarly, the set Jk is a subset of column indices of H^, made 
up of column indices (w,j) where w runs through all the words of length at 
most K. The set Jk is the set of columns indices of the matrix to be defined, 
and Im is the set of row indices of this matrix. 

Definition 29 (Sub-matrices of the Hankel-matrix H^) Define the ma- 
trix Hy tM ,K e R ImxJk by 

(Hsf,M,K)(v,i),(w,j) = (Hv)( v ,i),(w,j) = (Sj(wv))i 

for all (v,i) G Im and (w,j) G Jk- 

That is, H^ m,k is the left upper I M x Jk block matrix of H^. Notice that if 
J is finite, then | J K \ < +oo, that is, H^ m ,k is a finite matrix. 

It turns out that under certain circumstances partial representations not only 
exist but they also yield a minimal representation of the whole family of formal 
power series. Moreover, such partial representations can be constructed from 
finite data. 
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Theorem 10 (Existence of partial representation) With the notation above 
the following holds. 

(1) If for some N > 0, 

rank H %NjN = rank H^ tNtN+ i = rank H %N+ljN (B.2) 

then there exists a 2N + 1-partial representation Rn of^f, of the form 

R N = (Imf/tf^jv+i, {A a } aeX , B, C) (B.3) 

where the parameters of Rn are defined as follows. For each wordw G X* , 
\w\ < N + l, and each index j G J denote by (H-q, t N,N+i).,( w ,j) the column 
of Hy t x,N+i indexed by (w,j). With this notation, 

• lmH<q,N,N+i denotes the linear space spanned by the columns of Hy tN ,N+i- 

• For each a G X , the linear map A a : ImiJ^^iv+i ~~ ^ I m -f^*,Ar,iv+i has 
the property that for each word w G X* , \w\ < N and each index j G J 

A a ((H-q,^ N ,N+l).,{w,j)) — (HiSi t N,N+l).,(wa,j) (B-4) 

i.e. A a maps the column indexed by (w,j) G Jn to the column indexed 
by (wcrj). 

• For each w G X* , \w\ < N + l and j G J , the linear map C : 
Imi^jv.jv+i - > ^ p satisfies, 

C((H^ :N ,N+i).,( w ,j)) = 

-iT 

(Hy,N,N+l)(e,l),(w,j), (HiB,N,N+l)(e,2),(w,j), , (H^ 7 N,N+l)(e,p),(w,j) 

(B.5) 

That is, C maps each column to the vector in W formed by the first p 
entries associated with v = e of the column indexed by (w,j). 

• The set B = {Bj G lmHy N ,N+i \ j G J} is defined by 

Bj = (#vj/,jv,at+i).,(6j) for all j G J (B.6) 

i.e. Bj is simply the column of H^^n+i indexed by (e,j). 
In addition, the representation Rn is reachable and observable. 

(2) If for some N > 

rank H^^n,n = rank (B.7) 

then (B.2) holds and the representation Rn from (B.3) is a minimal 
representation of ^> . 

(3) If R is a representation of \& , dimi? < N + 1, then (B.7) holds for N 
and the representation Rn from (B.3) exists and it is a minimal repre- 
sentation of . 
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The proof of Theorem 10 is presented in Appendix B.2. Using the results of 
Theorem 10 we can state the following characterization of minimal partial 
representations of *. 

Theorem 11 (Minimal partial representation) With the notation of The- 
orem 10, if* satisfies (B.2), then the following holds. 

(1) A minimal 2N + 1 partial representation of * exists, in fact, the repre- 
sentation Rn of Theorem 10 is a minimal 2N + 1 partial representation 
of*. 

(2) Any minimal 2N + 1 partial representation of* is reachable and observ- 
able and it is of dimension rank H^^^. 

(3) All minimal 2N + 1 partial representations of * are isomorphic. 

The proof of Theorem 11 is presented in Appendix B.2. 

Remark 11 The reader might wonder why we speak of 2N + 1 -partial rep- 
resentations in Theorem 10 and 11. The reason behind it is that the finite 
Hankel-matrix i^^^^+i is formed by values of the formal power series from 
* for words of length at most 2N + 1. We would like the representation ob- 
tained from Hy NjN+1 to recreate at least the entries of the matrix H^ jNN+ i. 
But this means precisely that the representation obtained from H^ tNjN+1 should 
be a 2N + 1-partial representation of * . 

B.1.2 Partial realization algorithm 

In this section we present an algorithm, described in Algorithm 3, which com- 
putes a partial representation by factorizing the Hankel-matrix. The technique 
of Hankel-matrix factorization has been used in realization theory and systems 
identification for several decades. It forms the theoretical basis of algorithms 
for subspace identification, see for example [11,7]. Throughout the section, \1> 
stands for the family of formal power series * = {Sj G W p <C X* 3>| j E J}. 

Algorithm 3 represents an algorithm based on matrix factorization of H^ t N+i,N- 
In addition, if N is large enough, Algorithm 3 in fact yields a minimal rep- 
resentation of *. Algorithm 3 above may return two different types of data. 
It returns a rational representation if (B.10) has a unique solution, and the 
symbol NoRepresentation otherwise. 

Remark 12 (Implementation of the matrix factorization) In step 1 of 
Algorithm 3 above one can use any algorithm for computing a factorization. 
For example, one could use SVD decomposition, in which case i?^,jv+i,jv — 
UTiV T , and O = U{T}/ 2 ), R = (Y}I 2 )V T is a valid choice for decomposition. 

The following theorem characterizes the outcome Algorithm 3. 
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Algorithm 3 ComputePartialRepresentation(iJ^ ,Ar+i,jv) 
1: Compute a decomposition of Hy jN +i,N 

OR 

O G R /jv + lXr , R G W xJn , rank R = rank O = rank Hy jN +i,N = r 
2: Define the matrix C G IR pxr by 



C 



tl T tl T ■ ■ ■ (1 T 

^(€,1),' ^M),.' ^(e,p),. 



(B.8) 



where O^. denotes the row of O indexed by k G ijv+i- 
3: Define the family of vectors B = {Bj G W | j G J}, where for each j G J, 

#j = R.,(e,j) (B.9) 

where R.uj) stands for the column of i? indexed by (e, j). 
4: For each a G X let A a G M rxr be the solution of 

rl^r, (b.io) 

where f, T a G M /ivXr are matrices of the form 

F( u ,i),j = 0(u,i),j an d (rcr)(w,i)J = 0(au,i),j 

for all (u, i) G /at, j = 1, 2, . . . , r. 
5: If there no unique solution to (B.IO) then return NoRepresentation. Oth- 
erwise return 

Rn = (K r , {A x } x£ x, B, C) 



Theorem 12 Let * = {Sj e ff « I* »| j G J} k o family of formal 
power series. With the notation above the following holds. 

(1) Assume that form some N > 

rank H-q,^,N+i = rank H^^+i,n = rank H-q, t N,N (B-H) 

Then Algorithm 3 always returns a formal power series representation 
Rn and Rn is an 2N + 1-partial representation of \& . In fact, the repre- 
sentation Rn from Theorem 10 and Rn are isomorphic. Hence, Rn is a 
minimal 2N +l-partial representation of^/, it is reachable and observable. 

(2) If for some N > 0, 

rank Hi£ t N,N = rank (B.12) 
then (B.ll) holds, and Algorithm 3 returns a minimal representation Rn 
of*. 

(3) Assume rank H^, < N + 1, or , equivalently, there exists a representation 
R of \& , such that dim/? < N + 1. Then (B.12) holds and the represen- 
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tation Rn returned by Algorithm 3 is a minimal representation of ^ . 

Remark 13 (Solution to (B.10)) Although solution to (B.10) need not al- 
ways exist, one can always take a matrix A a , a G X as a solution to the 
minimization problem min^ \\FA a — r a \\ 2 . Then A a can be obtained using 
standard numerical techniques for solving approximation problems. With this 
modification, the algorithm can be applied even if (B.ll) fails. However, the 
representation returned by the modified algorithm need not be a 2N + 1-partial 
representation in this case. 

B.2 Proof of the partial-realization results for formal power series 

The goal of the section is to present the proof of Theorem 10, 11 and 12. In 
Subsection B.2.1 we will introduce some notation and state some preliminary 
results, which will be needed for the proof of the theorems. In Subsection B.2. 2 
we will present the proof of Theorem 10-12. Finally, in Subsection B.2. 3 we 
will present the proof of the technical results which are used for the proof of 
the Theorems 10-12. 

B.2.1 Auxiliary definitions and results 

To begin with, for the purposes of partial realization theory, we need to recall 
some basic steps of the proof of Theorem 8. To this end, we have to introduce 
additional notation and terminology. Let w G X* be a word over X* and let 
5 1 G F < X* > be a formal power series. Define the formal power series 
tuoSGfCl* >, called the left shift of S by w, as follows; we require that 
for all v E X* the value of w o S at v is as follows 

(woS)(v) = S(wv) (B.13) 

i.e. the value of w o S at v equals the value of S at wv. Notice that for any 
word w G X* of the form w = a±a 2 ■ ■ • o~k, 0\,(Ji ■ ■ ■ ,o~k G X and for any 
formal power series TgR p <C X* ^>, the following equality holds 

w o T = a k o (<r fc _i o (• • • (<ti o T) • • • ))) (B.14) 

Moreover, notice that the shift operation is linear, that is, for any T,S G 
W p X* 3>, and for any scalars a, j3 G R, and for any word w G X*, 
wo(aS + (3T) = a{w o S) + (3(w o T). 

Definition 30 (Smallest shift invariant space) Let $ = {5j £ l p < 

X* j G J} be a family of formal power series. Define the the smallest 
shift invariant linear space containing all the elements of ^, denoted by Wq,, 
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as the following subspace of MP <C X* 



Wq, = Span{u> o Sj eW <^x* ->\ j e J,w e X*} 



(B.15) 



That is, Wq, is composed of the linear combinations of all formal power series 
of the form w o Sj for some j G J and w G X* . 

Remark 14 It is easy to see that there is one-to-one correspondence between 
the formal power series w o Sj and the column of Hq, indexed by (w, j) for any 
word w G X* and index j G J. In particular, it follows that Wq, is isomorphic 
to the span of columns of Hq, and hence 



The statement of Theorem 8 follows from the following two auxiliary state- 
ments, proofs of which can be found in [44,42] 

Lemma 2 ([44,42]) Assume that divaWq, < +oo holds. Then a representa- 
tion Rq, of \& is given by 



where for each letter a G X , the map A a : Wq, — > Wq, is defined as the shift by 
a, i.e. for each T G Wq,, A a (T) = ooT ; the collection B = {Bj G Wq, \ j G J} 
is formed by the elements of ^ , that is Bj = Sj for each j G J; the linear map 
C : Wq, — > W is defined as the evaluation at the empty word, i.e. C(T) = T(e) 
for allT G Wq,. 

Lemma 3 ([44,42]) If is rational, then dimW 7 * < +oo. More precisely, 
for each representation R of \& , dimWq, < dimi?. 

Using the lemmas above the proof of Theorem 8 becomes trivial. For the 
sake of completeness we present it below. From Remark 14 it follows that 
dimW 7 ^ = rank Hq,. If rank Hq, < +oo, then Lemma 2 implies that Rq, is 
a well-defined representation of hence \& is rational. Conversely, if \& is 
rational then Lemma 3 implies that dimlL^j, = rank Hq, < +oo. 

Remark 15 The representation Rq, defined in (B.16) is called free. 

Below we will define various spaces which play a role analogous to Wq,. How- 
ever, before proceeding further, some additional notation needs to be set up. 

Notation 5 (Words of length at most N) We will denote the set of all 

words over X of length at most N by X^ N , i.e. X^ N = {w G X* | \w\ < N}. 



dim Wx 



rank 



Rq ! = (Wq f ,{A a } aeX ,B,C) 



(B.16) 
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We will start with defining the space W / $ ri jv of formal power series which 
corresponds to the subset of columns of Hq, indexed by indices of the form 
(w,j) with j G J and w G X- N . Define the vector space W^^n as the sub- 
space of Wq, spanned by all formal power series of the form w o Sj with j G J 
and w G X^ N , i.e. 

W*„ N = Span{(w o Sj) \j eJ,we X* , \w\ < N} (B.17) 

Notice that if J is finite, then Wy^N is generated by finitely many elements. 
The motivation for considering the space W^^n is revealed by the following 
lemma and its corollary. 

Lemma 4 Assume that rank Hq, < N . For any formal power series T G Wq, 
and any word w G X* , there exists scalars a w>v G R ; and words v G X-^ 1 
such that 

woT= a w , v (voT). 

That is, the shift of T by w is a linear combination of the shifts of T by words 
of the length at most N — 1. 

The proof of Lemma 4 will be presented in Subsection B.2.3. 
Corollary 1 //rank Hq, < N + 1, then W 9 = W 9> . >N . 

The proof of Corollary 1 will be presented in Subsection B.2.3. In other words, 
if N is big enough, then H^ vj .jv generates the whole space W-q,. Alternatively, 
stated in the language of Hankel-matrices, the columns of Hq, indexed by 
words of length at most N span the whole image of Hq,. 

Although Wisi,.,N is generated by finitely many elements, if J is finite, its 
elements are formal power series which contain infinite amount of data. In 
order to replace Wq,^^ by a space generated by finite vectors, we proceed as 
follows. We will define a linear space Wq, t M,N which plays a similar role for 
H^,m,n as does f° r Hy. Similarly to Wq,, the elements of Wq, : M,N are 
maps mapping words over X to vectors in R p . However, in contrast to Wq,, 
the elements of Wq, t M,N will be defined only on the words of length at most 
M. Similarly to Wq,, the space W^^m,n w iH allow us to prove our main results 
for the finite Hankel sub-matrices Hq, t M,N in a more intuitive way. 

To this end, we will introduce notation for restrictions of formal power series 
R p < X* > to words of length at most M. 

Notation 6 (Restriction of formal power series to X- M ) Denote byW p <C 
X^ M > the set of functions T : X^ M ->■ R p . 

It is clear that R p <C X- M ^> forms a vector space with point-wise addition 
and point-wise multiplication by scalar. 
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As the next step, define the map 

t]m : R p < X* >^ R p < X^ M > 

projecting any formal power series to its restriction to X- M , i.e. for each 
formal power series TeR p <r >, 

t]m(T)(w) = T(w) for all w G X- M (B.18) 

It is easy to see that i] M is a surjective linear map. Define the vector space 
W^ )N)M by 

W %m ,n = Span{77 M (™ ° 5,) | w G j G J} (B.19) 

It is easy to see that W^ t M,N is the image of W^^n by i] M . The relationship 
between W^^m,n and Hq, jMN can be best described as follows; the generator 
set of W^^m,n is formed by the columns of Hq, jMN . More precisely, recall from 
Subsection 4.0.2 that R /m denotes the vector space of all maps from Im to R. 
Define the map ip M ■ R p < >^ R /m by 

(V>m(T)) m = (T(v))i for all (u, i) G J M (B.20) 

where (T(t> )), stands for the ith entry of the vector T(-u) G R p . It is easy to 
see that ipM is a linear isomorphism. Moreover, Ihl£/^ iMi tv = ^MiW^,M,N)- I n 
particular, 

Corollary 2 dim VT^, )MiA r = rank Hq, MjN . 

Finally, we will show that if \1/ is rational and M is big enough, then the 
restriction of ipM to Wq, is injective. It means then that W^ t M,N and W^ jmj N 
are isomorphic for all N, if M is large enough. 

Lemma 5 Assume that rank Hq, = dim < M + 1. TTien the restriction 
°f Vm to Wq, is injective. 

B.2.2 Proof of the main results on partial realization theory of formal power 
series 

Now we are ready to present the proof of Theorem 10 - 12. 
PROOF. [Proof of Theorem 10] 

Proof of Part 1 We will define a 2N + 1 partial representation Rn of 
^ on the linear space W^^,n- The representation Rn will be reachable and 
observable and will satisfy a number of properties. 
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To this end, define the map 

VN+hN : R p < X^ +1 >^ W < X^ N > 

by requiring that for all S EW P <^ X- N+1 3> and for each word w G X- N , 

Vn+i,n(S)H = S(w) (B.21) 

It is easy to see t)n+i,n is a surjective linear map. Moreover, t]n+i,n maps the 
sub-space W-q, t N+i,N onto Wy^N. Using Corollary 2, (B.2) can be rewritten 

as 

dim W* iNiN = dim Wy iN+ljN = dim Wy jNiN+1 (B.22) 

But (B.22) implies that W^j^jv+i — W#,jv,jv and that the restriction of i] N+ i jN 
to W^jv.iv is injective. The latter means that the inverse map 

VN+l,N : Wy, t N,N — > Wy t N+l,N (B.23) 

exists. 

Define now the representation R N as follows. 

R N = (Wnf ,N,N+i,{A a } ae x,B,C) (B.24) 

where 

• For each letter a G X , the linear map 

is defined as follows. Consider the map T a : Wis t N+i,N — )■ M p X- N ^> 
defined by T a (Z)(v) = Z(av), for all v G X^ N and Z G W 9 , N+liN . It is 
easy to see that T CT is a linear map. In addition, if Z = r)N+i(woSj) for some 
w G X- N , j G J, then T a {Z) = t)n(w(J o Sj). Indeed, for any v G X- N , 
T a (Z)(v) = woSj(av) = Sj(wav) = waoSj(v). Hence, because of linearity 
of T a , the range of T a is W^^jv+i an d thus T a can be viewed as a map 
T a : W^^+i.jv — > W*,jv,jv+i- 

Then for all 5 G W^jv.jv = M^^.jv+i, define 

4,(5) = r ff (i^ liJV (s)) 

It is easy to see that A a is a well-defined linear map and it has the property 
that it commutes with the shift by a of a formal power series in W^^n, i.e. 
for all S G Wy^N, 

A ff (r iN (S))=r iN (aoS) (B.25) 

Here, we used the notation of (B.17). Indeed, notice that n(Vn(S)) = 
t)n+i(S). In addition, from the discussion above it follows that T a (r)N+i(S)) = 
rj N (a o S). Combining this with the definition of A a yields (B.25). 
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In addition, for all S G Wy^N, and for all v G X- N 1 and a G X, 
{A a (S)}(v) = {T a (Tij? +ltN (S)mv) = iVN+iMWi™) = 5 M (B-26) 

• The family = {Bj G W^^jv I j G J} is defined as 

Bj = m(Sj) for all j G J (B.27) 

i.e. -Bj is just the restriction of Sj to X- N . 

• The map C* is defined as 

C : W* )JV ,jv+i S{e) G K p (B.28) 

i.e. C is just the evaluation of the elements of W^ t N+i,N at the empty word 
e. 

It is easy to see that Rn is a well-defined rational representation. Next, we 
will show that Rjy is an 2N + 1-partial representation of To this end, by 
repeated application of (B.25) and using the equality W^j^jv = W^^^+i wc 
get that for any word w G X- N+1 it holds that 

A w Bj = A w (r) N (Sj)) = r) N (w o Sj) (B.29) 

Using (B.29) and repeatedly applying (B.26) yields that for each v G X- N , 

CA v A w Bj = {A v r) N (w o Sj)}(e) = {w o Sj}(v) = Sj(wv) (B.30) 

By noticing that A V A W = A wv and that any word w in X- 2JV+1 can be rep- 
resented as the concatenation w = wv of a word w G X- JV+1 with a word 
v G X^ N , we get that for all w G X^ 2Af+1 , 

That is, i^y is an 2N + 1-partial representation of From (B.29) it is easy to 
deduce that R N is reachable. Observability of R N can be derived as follows. 
From (B.26) it follows that for any T G W 9 , NiN+ i = W 9 , NiN , CA W T = T(w) 
for all words w G X^ N . Hence, if CA W {T) = for all w G X*, then for all 
w G X^ N , T{w) = 0, i.e. T = 0. That is, An = {0}. 

Define now the representation R^ as the isomorphic copy oi Rn with the 
isomorphism ^ defined in (B.20). That is, using the notation of the theorem, 
define the parameters A a , a G X, C and {-&,• | j G J} of i? w as follows. 

A CT = i/jnA^^ 1 for all <r G X, C — Cip^ 1 , Bj = ip N {Bj) for all j G J. 

(B.31) 

Since ^ defines a representation isomorphism : R^ — >■ -Rjv, it follows 
that .R/V is a 2iV + 1-rational representation of \1/ as well, and it is reachable 
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and observable. In addition, combining (B.31) with (B.25,B.27,B.28) yields 
(B.4,B.5,B.6). 



Proof of Part 2 

Recall that rank H^ :M>K = dimW-q, jM ,K for all K,M G {N,N + 1} and 
dimW 7 * = rank Hy. It is easy to see that 

dim Wv tNtN < dim W^ tKM < dim W^, for all K, M G {N, N + 1} (B.32) 

Hence, (B.7) implies that 

dim W 9 ,n,n = dim Wy M ,K = dim Wq, for all M,K e {N,N +1} 

Hence, (B.2) holds and by Part 1 the representation R N is well-defined and it is 
a 2iV+l-partial representation of Recall that (B.2) implies that Wy^N+i = 

It is left to show that Rn is a representation of \&. To this end, recall from 
(B.18) the definition of the map rj N and recall from (B.17) the definition of 

the set Wy : . : N- Since 77jv(W*,.,jv) = Wy^N, we get that 

dimWtf > dimW^^jv > dim W^^.jv = dimW*, 

hence H^ vj jv = W^. Notice that tjn maps W^^.^k onto W^jv.k for K G 
{iV, iV + 1}. Hence, the map ?7tv maps W 7 * onto W^jv.jv = W^j^at+i- That is, 
the restriction r] N to W 7 * yields a linear isomorphism i] N \ W9 : W 7 ^ — > W^, t N,N+i- 
Recall from Lemma 2 the definition of the free representation R$, of ^1, de- 
fined on the space Wq,. Using (B.25),(B.27) and (B.28), it is easy to see 
that the restriction of 77 at to W-q, in fact yields a representation isomorphism 
Vn\w 9 '■ -R* — * Rn, where Rn is as defined in (B.24). Since, Rq is a minimal 
representation of then so is its isomorphic copy Rn. Since Rn is merely an 
isomorphic copy of Rn, the same conclusion holds for R N . 

Proof of Part 3 

Recall from (B.17) and (B.19) the definition of the spaces H 7 ^,.,^ and Wy t M,N 
for M G N, M > 0. From Lemma 3 it follows that if dimi? < JV + 1, then 
dimW 7 * = rank < dimR < N + 1. Then by Corollary 1, W*,.^ = W 7 * 
for K — N, N + 1. Moreover, notice that the image of W^^jv by i]m equals 
W*,Af,Jv- Applying the above observations for M,K e {N, N + 1} and using 
Lemma 5 we get that 77^ and 77^+1 are injective and hence 

dim = dim W^^ K = dim W^m.k- where K, M = N, N + 1 (B.33) 

Combining (B.33) with Corollary 2 and Remark 14 we get that (B.7) holds. 
Combining this and Part 2 we get that the statement of Part 3 holds. 

Next, we will present the proof of Theorem 11. 
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PROOF. [Proof of Theorem 11] We will prove the statements of the theo- 
rem one by one. However, before proceeding to the actual proof, we need to 
introduce some notation. Let R be p-J rational representation, i.e. R is of the 
form (A.l). Define the family of formal power series ^r associated with the 
representation R as follows. The family ^/r is indexed by elements of J, i.e. it 
can be written as V R = {Sf £ l p « I' >| j G J}. For each index j G J, 
and for each word w G X*, the value of the formal power series at w is 
defined as S^w) = CA w Bj. It is easy to see that the rational representation 
R is a representation of ^r, i.e. ^/r is rational. In fact, R is a representation 
of the family of formal power series if and only if ^ R = i.e. Sj = 
for all j G J. Furthermore, R is a 2N + 1-partial representation of VP, if and 
only if Sf(w) = Sj(w) for all j G J and w G X- 2N+1 . In addition, if R is a 
2N + 1-partial representation of then 

H*,k,l = for all K + L < 2N + 1 (B.34) 

In particular, Hy t N,N = H^ r ,n,n- Now we are ready to proceed to the actual 
proof of the theorem. 

Proof of Part 1 

Assume that R is a 2N + 1-partial representation of \F From the discus- 
sion above it follows that rank H^, ;N>N = rank H^ r N ^ n . It is easy to see 
that rank Hq, R:N>N < rank Hy R . Moreover, from Theorem 9 it follows that 
rank H^, R < dimi?. That is, the rank of H^^,n is not greater than the di- 
mension of the representation R. But from Theorem 10 it follows the the 
dimension of the representation R N is precisely rank H^, ;N>N and that Rn is 
an 2N + 1-partial representation of Hence, we get that dimR N < dim/?. 
In other words, R N is a minimal 2N + 1 representation of ^. 

Proof of Part 2 

We have just shown that the rational representation R N of Theorem 10 is a 
minimal 2N + 1-partial representation of ^F If R is another minimal 2N + 1- 
partial representation of then the dimension of R must be equal to that of 
i?jv, which in turn equals rank Hy^^. Hence, the second part of the statement 
is proved. Assume now that R is a minimal 2N+ 1-partial representation of ^F 
Then it means that Sf(w) = Sj(w) for each j G J and w G X- 2N+1 , where Sf 
denotes the element of ^/r indexed by the index j G J. If R is not reachable 
and observable, then it is not a minimal representation of ^/r. From Theorem 
9 it follows then that there exists a reachable and observable representation 
R m of ^ R such that dimR m < dimR. But if R m = (X rn , {A™} aeX , B m ,C m ) 
is a representation of ^/r, then it is a 2^ + 1-partial representation of since, 
Sj(w) = Sf(w) = C m A™B m for all j G J and w G X^ 2N+l . Hence, R is not 
a minimal 2N + 1-partial representation of ^, which is a contradiction. 

Proof of Part 3 

Suppose R and R are two minimal 2iV+ 1-partial representations of ^ . Recall 
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the definition of the families of formal power series ^r and ty R associated 
with R. Since R and R have to be reachable and observable by Part 2 of 
the current theorem, by Theorem 9 the representation R (respectively R) is 
a minimal representation of ^r (respectively ^ R )- We will show that ^r and 
\1/ R are equal, i.e. Sf = for all j G J. This in turn implies that R and 
are both minimal representations of the same family of formal power series, 
and hence by Theorem 9 they are isomorphic. 

The proof that ^r and ^ R are identical proceeds as follows. Recall that R 
and R are both of dimension rank Hy jN ,Ni and recall (B.34). Hence, it fol- 
lows that rank H^ t N,N = rank H-q, Rt N,N = H^ R = dimi? and rank H^^n,n = 
rank n,n = rank = dimi?. It also follows that ^r and ty R satisfy 
(B.7) with \1/ replaced by ^/r and ty R respectively. Denote by R N and R 2 N the 
minimal 2N + 1-partial representations of \5/r and respectively ^ R , obtained 
by applying Part 1 of Theorem 10 to ^r and "& R respectively. By Part 2 
of Theorem 10 the representations R N and R 2 N are minimal representations 
of ^ r and ^ R respectively. However the construction of R l N and R 2 N depends 
only on H^ Rt N,N+i and H^ At N,N+i- The latter two matrices both coincide with 
H-$,n,n+i- Hence, R N and R 2 N are both equal to the partial representation Rn 
obtained by applying Part 1 of Theorem 10 to That is, rational represen- 
tation R N , obtained by applying Part 1 of Theorem 10 to ^, is a minimal 
representation of both fy R and fy R . This implies that ^r and fy R coincide. 

We conclude with the proof of Theorem 12. 

PROOF. [Proof of Theorem 12] 
Proof of Part 1 

We have to show that Algorithm 3 returns a representation Rn, and Rn is 
isomorphic to Rn- By Theorem 10, if (B.ll) holds then Rn is an 2N + 1- 
representation of From Theorem 11 it then follows that Rn is a minimal 
2N + 1-partial representation. Hence, if Rn is an isomorphic copy of Rn, then 
we get that Rn is a minimal 2N + 1-partial representation of \F 

Recall from (B.24) in the proof of Theorem 10 the definition of the represen- 
tation Rn- Using the remark in Subsection 4.0.2, we will interpret the matrix 
O G R T ^ xr as a linear map O : W ->■ R 7 ^ 1 defined by (Ox), = £Li O i:k x k 
for all x G lR r . Using this interpretation, we define the map 

i = Vn,n+i ° ^ n \i °0 :W ^ Wy jN ,N = W*,jv,jv+i (B.35) 

Since ipN defines a representation isomorphism ipN '■ Rn — > Rn it is enough 
to show that £ defines a representation isomorphism £ : i?^ — > R N and that 
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Algorithm 3 returns a rational representation if (B.ll) holds. Indeed, in this 
case £ = ipN ° £, defines a representation isomorphism £ : R N — > R N . 

It is clear that £ is well defined. Indeed, ImO = lmHq, jN+1N by definition of 
matrix factorization. Moreover, O : W — > ]R /Ar+1 is an injective linear map and 

is a linear isomorphism. Furthermore, since 

rank Hy iN+ljN = dim Wy jN+ i iN = rank Hy iNjN = dim W^ :NjN 

we get that t)n+i,n : VJ^jv+i.jv — W^iv.Ar is a linear isomorphism. Thus, 
£ = t]n+i,n ° V ; jv+ i ° O is a well defined linear isomorphism. It is left to show 
that £ is a representation morphism from iiV to R^. 



It is easy to see that for all w G X- , and x G 



(d)( w ,i), (Ox) (u)i2 ), ••• (Ox) (WtP) 



(B.36) 



Recall from Subsection 4.0.2 that (Ox)( W)i ) stands for the value (entry) of 
Ore G R /jv+1 corresponding to the index (w,i) G ijv+i- In particular, for any 
(v,j) G J N , 

i(R., M ) = m(v o Sj) (B.37) 
where R.,( v ,j) stands for the column of R indexed by (v,j). Indeed, using (B.36) 
we get that for all w G X- N , 



i(R.,(vj))(w) 



(OR) 

(w,l),(v,j) 1 

(OR) (OR) 

(w,p),(v,j) 



(B.38) 



But i?*,jv+i,jv — OR, and hence the right-hand side of (B.38) equals the 
vector formed by the entries (i?*,jv+i,jv)(«),i),(i;,j) for i = 1,2..., p. But the 
entry (i?*,jv+i,jv)(u),i),(i;,j) equals the ith entry of Sj(vw), hence 

£(^>,;))M = = do ^-(w) 

From (B.36) it follows that 

C(i(x)) = i(x)(e) = Cx (B.39) 

From (B.37), it follows that 

i(Bj) = ritfiSj) = B 3 (B.40) 



It is left to show that a unique solution to equation (B.10) exists and 

AJ = £X (B.41) 
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for all a G X. First, notice that TR = H^,^^. Thus, rank TR = rank H^^,n = 
r, i.e. rank T = r. Hence, if a solution solution (B.10) exists, then this solu- 
tion is unique. Therefore, if we show that £ _1 is a solution to (B.10) then 
(B.41) automatically holds. Notice that £~ is a linear map from W to W 
and hence it can be identified with its matrix representation. By identifying 
£ -1 A (7 £ with its matrix, and using (B.37) and (B.25), it follows that for any 
(v,j) G J N and (w,i) G I N , 

(T^A^R)^^^^ = r^^X^A^R).^) = 
T^XC'aJr.^)) = T^J-'A^iv o Sj) = (B.42) 
f {wdJ^miva o Sj) 

Here we used the notation of Subsection 4.0.2 to denote rows and columns of 
matrices. In particular, M_^ v ^ denotes the column of a matrix M indexed by 
(v,j) and M( tUji ) v denotes the row of a matrix M indexed by (w, i). Notice that 
if w G X- N , then the row of F indexed by (w, i) equals the row of O indexed by 
(w,i). Moreover, if x = ^~ l {ri^{va o Sj)), then from (B.36) it follows that the 
row of 0(x) indexed by (w, i) equals the ith entry of rjN(vcroSj)(w) = Sj(vaw). 
Hence, the last expression of (B.42) can be rewritten as 

f ( w ,i),i _1 ^iv(wo- o Sj) = Od^rjNiva o Sj)) {W:i) = (Sj(vaw))i (B.43) 

where (Sj(vaw))i denotes the ith entry of Sj(vaw) G MP. On the other hand, 
the row of IV indexed by (w,i) equals the row of O indexed by (aw,i), if 
w G X^ N . Hence, 

(^aR)(w,i),(v,j) = (0)(aw,i),.R.,(v,j) = (Hy, N+1 , N )( aW! i),( v ,j) = (Sj(v<Tw))i (BAA) 

Combining (B.42), (B.43) and (B.44) we get that T^AjR = T a R. Since 
rank R = r, i.e the columns of R span the whole space W, the last equality 
implies that £~ is a solution to (B.10). 

Proof of Part 2 

From Theorem 10 it follows that if (B.12) holds, then (B.ll) holds and R N 
is a minimal representation of By Part 1 of this theorem, if (B.ll) holds, 
then Algorithm 3 returns a representation R^ and Rn is isomorphic to Rn- 
Hence Rn is a minimal representation too. 

Proof of Part 3 

Again, from Theorem 10 it follows that (B.12) holds in this case. The state- 
ment follows now from (B.12) and Part 2 of this theorem. 
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B.2.3 Proof of the auxiliary results 



We conclude the section with presenting the proof of Lemma 4 and Lemma 
5 used in the proof of Theorem 10. The proof relies on the following chain of 
results, which are interesting on their own right. 

Lemma 6 Let X be finite- dimensional vector space, divciX < N. Let A a : 
X^-X,aeXbea family of linear maps. Then for each y G X , for each 
w G X* , the vector A w y is a linear combination of the vectors A v y, for finitely 
many words v G X-^ -1 . 

PROOF. If \w\ < N, then the statement of the lemma is trivially true. First 
we prove the lemma for \w\ = N. Assume that w = o~\o~2 • • • o~n, ci, 02, . . . , o~n G 
X. Consider the elements A aia2 ... ai y, i = 0, . . . , N. Since every N + 1 elements 
of X are linearly dependent, we get that there exist i = 0, . . . , N such that 
A ai ... a .y is a linear combination of A ai ... a .y, j = 0,2, . . .,i — 1. Then we get 
that A w y = A ai+1 ... aN (A ai ... ai y) is a linear combination of vectors of the form 
A a - 1 ... a - j (r i+1 ...er N y, j — 0, ... ,i — 1. Notice that each word o~\ • • • <x,o- i+ i • • • un is 
of length at most N — 1. To prove the lemma for arbitrary \w\ > N we proceed 
by induction on \w\ — N. The case of \w\ = N we proved above. Assume that 
the statement of the lemma holds for \w\ < n + N . Assume now that w is of 
the form w = sa, where s E X* , \s\ < n + N and a £ X is the last letter of 
w. From the induction hypothesis it follows that A s y is a linear combination 
of vectors of the form A v y, v G X-^ -1 . But then A w y = A cr (A s y) is a linear 
combination of vectors of the form A a A v y = A va y. For each v G X-^ -1 , either 
va G X-^ -1 or \va\ = N. In the latter case, by induction hypothesis A va y is 
again a linear combination of vectors of the form A^y, v G X- N ^. Altogether, 
we get that A w y is a linear combination of the vectors A v y, v G X-^ -1 . 

The lemma above yields the following characterization of the reachability and 
observability subspaces (see Definition 25 and 26) of a rational representation. 

Corollary 3 Consider a p — J representation R = (X, {A a } ae x, B, C). As- 
sume that dimi? < X. With the notation of (A.J^-A.S), the following holds. 

R = H kerCA, 

vex< N -~L 

W R = Span{A v Bj | j G J,v G X^" 1 } 

If X = W 1 and J is finite, then the above corollary states that the observability 
subspace Or is the kernel of a finite matrix, and the reachability subspace W R 
is the image of a finite matrix. 
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PROOF. [Proof of Corollary 3] It is clear that R C (X eX <*-i kerCA v . We 
will show that the reverse inclusion holds as well. Assume that x G X is 
such that CA v x = for all v G X-^ -1 . By Lemma 6, for each w G X*, 
A w x is a linear combination of A v x for v G X-^" 1 . Hence, CA w x is a linear 
combination of CA v x, v G X-^ -1 and the latter implies that CA w x = 0. That 
is, n^ex<~-i kerCA, C R . 

Similarly, it is clear from the definition that Span{A v Bj \ v G X- N_1 ,j G 
J} Q Wr. The reverse inclusion follows from Lemma 6, according to which 
for each word w G X*, A w Bj is a linear combination of the vectors A v Bj, 
v G X^- 1 . Hence, we get that W R C Span^B, \ v G X^ N ~\j G J}. 

Finally, we will present the proof of Lemma 4 and Lemma 5. 

PROOF. [Proof of Lemma 4] We will use the fact that dim Wq, = rank Hq, < 
N + 1. Consider the free representation Rq, = (W^, {A a } a€ x, B, C) of ^ de- 
fined in Theorem 8. Notice that for any word X*, w o Sj = A w Bj. Apply 
Lemma 6 to W^, A a : Wq, — > Wq,, a G X and y = Bj. Then we get that 
for any word w G X*, the formal power series A w Bj = w o Sj is a linear 
combination of the formal power series A v Bj — v o Sj with v G X- N . 

PROOF. [Proof of Lemma 5] It is easy to see that i]m is a surjective linear 
map. Consider the free realization Rq, = (Wq,, {A a } aeX , B, C) of From 
Theorem 9 we know that Rq, is minimal and therefore it is reachable and 
observable, i.e. R<S = {0}. From Corollary 3 we also know that if dimi?^ = 
dim IF* = rank Hq, < M + 1, then RxB = p\ v&x < M ker C A v . Consider the 
kernel of i]m- For any formal power series S G IF*, i]m{S) = if and only if 
S(w) = for all w G X^ M . Hence, CA W S = S(w) = for each w G X^ M , i.e. 
S G R y = {0}. Thus, r] M is injective. 
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